AFRL-SR-AR-TR-08-0153 


REPORT  DOCUMENTATION  PAGE 


The  fut'«  rooninfi  burden  tor  cofrciisn  at  mlarmatten  u  ntUrrwtad  to  **•/*$ i  t  hour  f-m  loipoma,  tnch*4n$  to  ...  «n»w»>9  ruvwtwni,  Hxtog  nnlng  data  towcri. 

aattwktg  ana  rn*nt»rung  th*  de t*  rwrdrt,  and  comr**  V»o  and  toviawinf  to  colircikin  at  Weimaile*,  Sand  lornmnu  ragardng  thii  biadan  at  lima  to  or  arty  ctor  aspact  at  th»  cattoctdn  at 
rlaniMn,  r»dw d**f  tuggot tiom  tot  irdyAlno  to  biadcn.  to  Deportment  at  D*(n»r,  V/iMn^an  Haadquarlar*  SrtYt*i,  Dccctorala  lO»  «ntormat«n  OiMitan*  and  Ripoftl  10704-OIUt. 
1211  Jrflnan  Davo  IfigfmdV,  Suito  1204.  ArtMim,  VA  232024002,  Ratportoarttt  IhcuM  ba  bwara  tot  ratwilMtifdrg  any  t  tor  ptobn  «t  law.  no  par  ion  than  ba  tub, act  «  any 
pen  a  ty  lor  t*Vng  to  comply  wltft  a  to  Dae  lion  ol  information  it  it  daai  not  ditpiay  a  cwrtaiMty  vaM  0M8  contra*  numbar. 

PLEASE  DO  NOT  RETURN  YOUR  FORM  TO  THE  ABOVE  ADDRESS. 

1.  REPORT  DATE  fDD-MM-YYYY!  2.  REPORT  TYPE 

02-12-2008  Final 

3.  DATES  COVERED  (from  •  To) 

3/1/04-8/31/07 

4.  TITLE  AND  SUBTITLE 

Tracking  with  a  swarm  of  GMTl  Equipped  UAV’s 

5a.  CONTRACT  NUMBER 

FA9550-04- 1-0141 

5b.  GRANT  NUMBER 

5c.  PROGRAM  ELEMENT  NUMBER 

6.  AUTHORIS) 

Yaakov  Bar-Shalom 

5d  PROJECT  NUMBER 

5*.  TASK  NUMBER 

5f.  WORK  UNIT  NUMBER 

7.  PERFORMING  ORGANIZATION  NAMEtS)  AND  AODRESS(ES) 

University  of  Connecticut,  Unit  2157 

Storrs,  CT  06269-2157 

8.  PERFORMING  ORGANIZATION 

REPORT  NUMBER 

ECE  070212 

$.  SPONSORING/MONITORING  AGENCY  NAME(S)  AND  ADDRESS(ES| 

Office  of  Naval  Research 

Boston  Regional  Office 

495  Summer  Street,  Room  627 

Boston,  M A  02210-2109  ^ 

10.  SPONSOR/MONITOR'S  ACRONYMIS) 

11  SPONSOR/MONITOR-S  REPORT 

NUMBER(S) 

12.  DiSTRlBUTION/AVAlLABlLITY  STATEMENT 


Unlimited 


13.  SUPPLEMENTARY  NOTES 


14.  ABSTRACT 


The  problem  considered  in  this  report  is  the  optimization  of  the  information  obtained  by  a  group  of  UAVs  carrying  out  surveillance  - 
-  search  and  tracking  —  over  a  large  region  which  includes  a  number  of  targets.  The  goal  is  to  track  detected  targets  as  well  as 
search  for  the  undetected  ones.  The  UAVs  are  assumed  to  be  equipped  with  Ground  Moving  Target  Indicator  (GMTl)  radars,  which 
measure  the  locations  of  moving  ground  targets  as  weli  as  their  radial  velocities  (Doppler). 


15  SUBJECT  TERMS 


16.  SECURITY  CLASSIFICATION  OF: 

17.  LIMITATION  OF 

ABSTRACT 

18.  NUMBER 
OF 

PAGES 

l 

19a.  NAME  OF  RESPONSIBLE  PERSON 

a.  REPORT 

b.  ABSTRACT 

c.  THIS  PAGE 

Yaakov  Bar-Shalom 

13 

1  U 

U 

19b.  TELEPHONE  NUMBER  PM/Mto  m  c odd 

860-486-4823 

Standard  Form  298  IPev  8/98) 

20101014472 


AIR  FORCE  OF;  CE  OF  SCIENTIFIC  R[  SEARCH 


26  FEB  2008 

DTIC  Data 

Page  1  of  2 

Purchase  Request  Number: 

FQ8671 -0600467 

BPN: 

F1ATA05256B467 

Proposal  Number: 

04-NM-039 

Research  Title: 

TRACKING  WITH  A  COOPERATIVELY  CONTROLLED  SWARM  OF  GMTI  EQUIPPED 
UAVS 

Type  Submission: 

Inst.  Control  Number: 

FA9550-04-1  -0 1 4 1 P00003 

Institution: 

UNIVOF  CONNECTICUT 

Primary  Investigator: 

Professor  Yaakov  Bar-Shalom 

Invention  Ind: 

none 

Project/Task: 

2304T  /  B 

Program  Manager: 

Juan  R  Vasquez 

Objective: 

To  develop  an  autonomous  UAV  based  surveillance  of  several  ground  targets  distributed  over  a  large  area 


Approach: 

To  develop  a  cooperative  control  algorithm  where  the  coupling  of  individual  UAVs  is  not  through  their  dynamics,  but  rather 
through  their  decision  making  algorithms  that  used  target  tracking  metrics  for  their  performance 

Progress: 

Year:  2004  Month:  11 

ANNUAL  REPORT  FOR:  FA9550-04-1-0141 

Status  of  Effort: 

An  algorithm  for  Optimal  UAV  placement  for  Suveillance  was  obtained  and  published  in  [273,  284], 

The  algorithm  for  optimal  UAV  placement  for  surveillance  can  improve  their  utilization  [273,  284] 

Year:  2005  Month:  09 
Annual  Report  for  FA9550-04-1-0141 

Our  current  work  deals  with  the  development  of  practical  advanced  algorithms  for  the  optimal  cooperative  placement  of 
UAVs  with  GMTI  (Doppler)  radar  for  the  purposes  of  surveillance  and  ground  target  tracking. 

Year:  2006  Month:  10 

Development  of  an  autonomous  UAV  based  surveillance  of  several  ground  targets  distributed  over  a  large  area. 
Development  of  a  cooperative  control  algorithm  where  the  coupling  of  individual  UAVs  is  not  through  their  dynamics,  but 
rather  through  their  decision  making  algorithms  that  used  target  tracking  metrics  for  their  performance  An  algorithm  for 
Optimal  UAV  placement  for  surveillance  was  obtained  and  published  Current  work  deals  with  the  development  of 
practical  advanced  algorithms  for  placement  of  UAVs. 

Year:  2008  Month:  02  Final 

The  problem  considered  in  this  paper  is  the  optimization  of  the  information  obtained  by  a  group  of  UAVs  carrying  out 
surveillance  of  several  ground  targets  distributed  over  a  large  area  The  UAVs  are  assumed  to  be  equipped  with  Ground 


Optimal  Placement  of  GMTI  UAVs  for  ground  target  tracking 


Abhijit  Sinhaa,  Thia.  Kirubarajan*  and  Yaakov  Bar-Shalom6 


a  Electrical  and  Computer  Engineering  Dept. 
McMaster  University 
Hamilton,  ON  L8S  4K1 ,  Canada 


bElectrical  and  Computer  Engineering  Dept. 
University  of  Connecticut 
Storrs,  CT  06269,  USA 


Abstract — With  the  recent  advent  of  moderate-cost  unmanned 
(or  uninhabited)  aerial  vehicles  (UAV)  and  their  success  in 
surveillance,  it  is  natural  to  proceed  to  consider  the  coop¬ 
erative  management  of  groups  of  UAVs.  The  problem  con¬ 
sidered  in  this  paper  is  optimization  of  the  information  ob¬ 
tained  by  a  group  of  UAVs  carrying  out  surveillance  of  sev¬ 
eral  ground  targets  distributed  over  a  large  area.  The  UAVs 
are  assumed  to  be  equipped  with  Ground  Moving  Target  Indi¬ 
cator  (GMTI)  radars,  which  measure  the  locations  of  moving 
ground  targets  as  well  as  their  radial  velocities.  In  this  re¬ 
search  entropic  information,  obtained  from  the  information 
form  of  Riccati  equation,  is  used  as  the  criterion  function. 
Sensor  survival  probability  and  target  detection  probability 
for  each  target-sensor  pair  are  also  included  in  the  criterion 
function.  The  optimal  sensor  placement  problem  is  solved 
via  deterministic  as  well  as  randomized  optimization.  Simu¬ 
lation  results  on  two  different  scenarios  are  presented  for  four 
different  types  of  prior  information. 
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1.  Introduction 

Sensors  like,  for  example,  radars  and  MTI,  are  used  to  make 
observations  of  a  moving  target  with  the  objective  of  estimat¬ 
ing  the  state  of  the  target.  Multiple  sensors,  which  give  differ¬ 
ent  perspectives  of  one  or  more  targets  at  the  same  time  or  at 
different  times,  can  be  used  to  enhance  estimation  results.  An 
important  application  of  control  theory  is  to  manage  multi¬ 
ple  sensors  such  that  the  expected  information  obtained  from 
them  can  be  maximized.  Management  of  multiple  sensors 
involves  gathering,  exchanging  and  combining  information. 
When  the  sensor  platforms  are  mobile  one  has  to  decide  the 
optimal  placement  of  sensors.  With  the  recent  advent  of  af¬ 
fordable  unmanned  aerial  vehicles  (UAV)  and  their  proven 
effectiveness  in  surveillance,  it  is  natural  to  consider  the  co¬ 
operative  management  of  groups  of  UAVs. 

A  number  of  UAV  management  algorithms  can  be  found  in 
the  literature.  In  [3]  a  hierarchical  approach,  that  uses  modi¬ 
fied  Voronoi  diagram  to  generate  possible  paths,  to  intercept 


a  number  of  known  targets  using  a  number  of  UAVs,  is  pre¬ 
sented  Similar  approaches  can  be  found  in  [7],  [8].  A  search 
algorithm  for  targets  in  a  given  area  is  proposed  in  [5],  where 
a  discrete  time  stochastic  decision  model  is  formulated  as  the 
path  planning  problem,  which  is  then  implemented  with  a  dy¬ 
namic  programming  algorithm  [9].  However,  the  aim  of  [5] 
is  only  to  detect  (not  to  track)  targets  in  the  search  region. 
In  [4]  a  decentralized  sensor  management  algorithm  is  pre¬ 
sented.  In  this  paper  an  entropic  information  measure  is  used 
as  the  objective  function  in  a  target  cuing  and  target  hand-off 
problem  and  a  multi-platform  bearing  only  tracking  problem. 

In  this  research  we  use  an  information  theoretic  approach 
similar  to  [4].  Here  the  GMTI  radars  measures  radial  ve¬ 
locity  as  well  as  position  of  the  targets  Also,  the  survival 
probability  of  a  sensor  from  hostile  fire  by  the  targets  and  de¬ 
tection  probability  of  a  target  corresponding  to  a  particular 
sensor  are  considered.  Development  of  the  criterion  function 
is  discussed  in  Section  2.  In  Section  3  different  approaches, 
both  randomized  and  deterministic,  to  find  the  maxima  of  the 
criterion  function,  are  discussed  and  a  mixed  approach  is  de¬ 
veloped.  Simulation  results  are  presented  in  Section  4 

2.  Criterion  Function 

The  criterion  function,  obtained  from  the  entropic  informa¬ 
tion  measure  [4],  is  given  by 

J'  =  E1°  g(IA(*l*)_ll)  (>) 

j 

where  Pj(k\k)  is  the  posterior  covariance  of  the  state  vec¬ 
tor  corresponding  to  target  j  at  time  t *  and  can  be  written  in 
terms  of  the  state  prediction  covariance  Pj(k\k  —  1)  and  new 
information  Yj(k)  as  follows  [1] 

Pjiklk)-1  =  P,(k\k  - 1)-1  +  Yj{k)  (2) 

The  matrix  Yj{k)  is  the  total  new  information  about  target  j 
obtained  by  different  sensors  The  information  obtained  by  a 
particular  sensor  s  about  target  j  is  given  by 

Y(k,s,j)  =  H(k,8,j)'R{s,j)-lH(k,8,j)  (3) 

where  H(k,  s,j)  is  the  measurement  matrix  and  i?(s,  j)  is  the 
measurement  covariance  matrix  corresponding  to  the  sensor- 
target  pair  s-j.  The  new  information  also  depends  on  target’s 
detection  probabilities  corresponding  to  each  sensor  and  the 
survival  probability  of  the  sensors.  The  new  information  ob¬ 
tained  by  sensor  s  about  target  j  is  reduced  depending  on  this 
sensor’s  survivable  probability.  The  modified  information  is 
given  by 


Y(k,sJ)  =  7rs(s)Y(k,s,j) 


(4) 
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where  ^5(5)  is  the  total  survival  probability  of  sensor  5, 
which  is  equal  to  the  product  of  survival  probabilities  of  this 
sensor  corresponding  to  each  target. 

^(s) = @) 
5 

Let  us  assume  that  the  target  j  is  detected  exactly  by  a  set  C 
of  sensors.  The  probability  of  such  an  event  is  given  by 

kd(C,  j)  =  (6) 

a 

where  J(C,  5)  is  an  indicator  given  by 

<7> 

Total  information  gain  when  a  target  j  is  detected  exactly  by 
a  set  C  of  sensors  is  given  by 

=  (8) 

a€C 

Hence  considering  all  possible  sets  similar  to  C  the  updated 
information  measure  can  be  written  as 

Pjiklk)-1  =  Pjm-l)-1  +Y/*D(CJ)Y(k,C,j) 

vc 

(9) 

VC 

■Y/^s(s)H(k,s,jYR(s,j)-lH(k,s,j) 

*€C 

(10) 


It  can  be  shown  that  (10)  reduces  to 

=  Pj(fc|fc  -  1)~*  4-  y^7T s{s)np(sj) 

a 

■H  (fc,  s,j)'R(s,  j)~l  ff(k,s,j)  (11) 

Figure  1  shows  a  notional  survival  probability  and  detection 
probability  vs.  range  between  the  target  and  the  sensor.  It  is 
important  to  note  that  in  a  real  life  scenario  the  survival  and 
detection  probability  depend  on  the  terrain  topography. 


The  state  vector  of  target  j  is  given  by 

V  =  [  XJ  1 i  y3  v>i  ]  (12) 

where  x  and  y  are  the  position  components  in  cartesian  coor¬ 
dinate  and  vX9  vy  are  the  velocity  components.  The  approx¬ 
imate  measurement  matrix,  which  comprises  of  x-y  position 
and  radial  velocity  r,  of  target  j  corresponding  to  a  sensor  s, 
is  given  by 


H(k,s,j) 


10  0  0 
0  0  10 
0  cos(a(s,j))  0  sin(a(s,j)) 


(13) 


where  a(s,  j)  is  the  azimuth  angle  of  the  target  j  measured  by 
sensor  s.  The  original  position  measurement  are  in  form  of 


Figure  1.  Detection  probability  and  survival  probability 
w.r.t.  range 


range  r(s,  j)  and  azimuth  a(s,  j)  which  are  converted  to  x-y 
position  using  the  standard  conversion  [1].  The  measurement 
covariance  matrix  R(s,j)  is  given  by 


where 


R(s,j) 


R\,i  Ri,2  0 

R\,2  -^2,2  0 

O'  0  cr? 


(14) 


Ri,i  =  r(s,j)2als\n{a(s,j))2  +  atcos(a(s,j))?\5) 

R2,2  =  r{s>j)2a\  cos(a(s,j))2  +  a2 sin(a(s,  j))^16) 

R\,2  =  (cr?  ~  r(s,  j)2ul)  sin(a(s,  j))cos(a(s,  j)) 

(17) 


3.  Search  Techniques 

In  this  work  we  have  considered  both  gradient  based  search 
technique  as  well  as  randomized  search  technique  to  find  the 
maxima  of  the  criterion  function. 


Newton-Raphson  Method — The  Newton-Raphson  method  is 
a  deterministic  technique  in  which  one  step  update  is  given 
by 

snew=s  \SJ  (18) 

where  s  is  a  vector  that  denotes  current  x-y  positions  of  the 
sensors,  V8  J  denotes  the  gradient  of  the  criterion  function 
w.r.t.  s  and  is  a  factor  that  is  decided  by  a  line  search  tech¬ 
nique.  Thus,  the  Newton-Raphson  method  requires  the  first 
and  second  derivatives  of  the  criterion  function  J  w.r.t.  x-y 
position  of  each  radar.  For  each  sensor-target  pair  s-t ,  the 
derivative  of  Ys,t{k)  is  first  evaluated  w.r.t.  range  r(s,  t)  and 
azimuth  a (5,  £)  and  then  converted  to  derivatives  w.r.t.  the  x- 
y  position  of  the  sensor.  Using  these  derivatives  V*  J  and 
VgJ  are  obtained.  A  detailed  discussion  of  the  procedure 
can  be  found  in  Appendix.  Although  the  Newton-Raphson 
method  produces  very  fast  convergence,  it  fails  in  the  pres¬ 
ence  of  local  maxima,  as  is  the  case  for  the  criterion  function 


J.  However,  it  can  be  very  useful  to  refine  the  results  of  other 
algorithms,  which  is  discussed  later  in  this  section. 

Genetic  Algorithm — This  is  a  randomized  technique  suitable 
for  problems  with  multiple  optimal  points.  To  apply  this  al¬ 
gorithm  in  the  current  problem  the  positions  of  sensors  are 
discretized  to  one  meter  resolution  and  converted  to  a  bit 
string.  A  population  size  of  100  is  used  in  this  application. 
First  generation  is  obtained  randomly  and  next  generations 
are  obtained  by  crossover  operation,  which  is  applied  sepa¬ 
rately  on  the  x  and  y  positions  of  each  sensor.  The  parents 
are  selected  randomly  depending  on  their  fitness,  which  is  a 
shifted  version  of  the  value  of  criterion  function  correspond¬ 
ing  to  the  sensor  positions  indicated  by  the  parent.  Along 
with  the  crossover  operation,  the  bits  of  the  candidates  in  the 
next  generation  can  also  change  due  to  mutation  operation.  In 
this  application  the  mutation  probability  of  each  bit  is  1/30. 
Elitism  was  used  to  preserve  best  parents  in  the  next  genera¬ 
tion. 


which  depends  on  the  revisit  time  A i  =  ti+\  —  ti,  is  given  by 


Fj(l)  = 


1  A,  0  0 

0  10  0 
0  0  1  A, 

0  0  0  1 


(21) 


The  process  noise  matrix  Q(l)  is  obtained  following  a  dis¬ 
crete  white  noise  acceleration  model 


<3(0  = 


?A' 

}a? 


0 

0 


IA? 

A? 


0 

0 


0 

0 


iA4 

4  “l 

Ia3 

2al 


0 

0 


5  A? 

A? 


(22) 


where  o„  is  the  variance  of  the  white  noise  acceleration  pro¬ 
cess.  In  our  simulations  ov  =  0.5  m/sec  One  important 
property  of  information  form  of  Riccati  equation  is  that  the 
information  from  different  sensors  can  be  written  in  the  sum¬ 
mation  form  as  in  (20). 


This  algorithm  eventually  reaches  the  global  maxima  without 
being  trapped  by  the  local  ones.  However,  rate  of  conver¬ 
gence  of  this  algorithm  slows  down  when  it  reaches  close  to 
the  maxima.  For  fast  convergence  and  to  avoid  the  method 
being  trapped  by  any  local  maxima  we  choose  a  combina¬ 
tion  of  the  search  techniques  discussed  above.  Initial  search 
is  performed  randomly  using  the  genetic  algorithm  and  then 
the  best  result  of  this  search  is  used  as  the  initial  point  for  the 
Newton-Raphson  method. 

4.  Results 

In  this  section  we  present  the  results  obtained  on  two  target 
scenarios  using  the  search  technique  discussed  in  Section  3 
for  the  criterion  function  discussed  in  Section  2.  Both  of  the 
target  scenarios  consist  of  two  sets  having  three  target  each 
and  one  set  having  four  targets.  In  the  first  type  of  scenario 
the  target  sets  form  a  triangular  shape  (Figure  2)  and  in  the 
second  one  the  targets  form  a  linear  shape  (Figure  6).  In  all 
scenarios  the  height  of  the  sensors  above  the  ground  is  con¬ 
sidered  to  be  1  km  and  the  ground  is  considered  to  be  flat. 
The  measurement  noise  standard  deviations  are  oT  —  5  m, 
as  =  1°  and  =  1  m/sec.  To  evaluate  the  criterion  function 
a  state  prediction  covariance  matrix  Pj(k\k  —  1)  is  required 
for  each  target  j.  Steady  state  value  of  Pj(k\k  -  1)  can  be 
used,  which  is  obtained  using  the  information  filter  form  of 
Riccati  equation  given  by 

P^  +  lIZ)-1  =  AjW-AMiA^  +  Qiiy'y'Ajil) 

(19) 


and 


Aj(l) 


(FU)-1)'  (PjW-i)-1 


where  Cj(l)  is  the  set  of  sensors  that  detects  target  j  at  time 
tt  and  F(l)  is  the  transition  matrix  between  time  ti  to  ti+ 1, 


Figure  2.  Scenario  for  the  calculation  of  the  steady  state 
prediction  covariance  matrix 


The  steady  state  covariance  matrix  is  obtained  using  the  sce¬ 
nario  shown  in  Figure  2.  For  simplicity  it  is  assumed  that  the 
targets  move  in  the  same  direction  as  the  sensors  and  their 
relative  position  remains  the  same  over  time.  The  revisit  time 
is  assumed  to  be  A/  =  10  sec  for  all  £/.  The  resulting  steady 
state  prediction  covariance  matrices  are  different  for  differ¬ 
ent  target.  It  is  also  observed  that  the  correlations  between  x 
component  of  position  or  velocity  and  y  component  of  posi¬ 
tion  or  velocity  vary  in  a  wide  range  and  can  have  both  pos¬ 
itive  and  negative  values.  However,  correlations  between  the 
position  and  velocity  components  of  any  particular  direction 
(x  or  y)  are  very  close  to  1/3.  The  following  steady  state  pre¬ 
diction  covariance  matrix,  which  shows  an  average  behavior, 
was  selected  for  each  target 

±10  0  0 

1  0  0 

0  102  ±10 

0  ±10  1 


Pi  = 


102 

jio 

0 

0 


(23) 


We  can  also  observe  the  effect  of  a  state  prediction  covariance 
matrix,  which  is  far  from  the  steady  state,  on  optimal  sensor 
positions.  The  following  matrix  is  used  for  this  purpose 


P2 


1002  ±500  0  0 

j500  52  0  0 

0  0  1002  j500 

0  0  ±500  52 


(24) 


The  following  two  state  prediction  covariance  matrices  are 
used  to  obtain  optimal  sensor  positions  when  variances  are 
high  in  one  direction  (z  or  y)  and  low  in  another. 


^3 


P4 


'  102  ±10  0  0 

£10  l2  0  0 

0  0  1002  ±500 

0  0  ±500  52 

1002  ±500  0  0 

±500  52  0  0 

0  0  102  ±10 

0  0  ±10  l2 


(25) 


(26) 


Figures  3-8  show  the  optimal  sensor  positions  when  the 
steady  state  prediction  covariance  matrix,  which  is  shown  in 
(23),  is  used  in  the  criterion  function.  These  figures  show 
that  for  both  scenarios  the  sensors  concentrate  near  the  largest 
group  (particularly  in  Figures  3  and  4).  Since  in  this  case  the 
predicted  target  state  is  highly  accurate,  some  of  the  targets 
can  be  neglected  while  placing  the  sensors.  Optimal  target 
positions  are  presented  in  Figures  9-1 1  when  state  predic¬ 
tion  covariance  matrix,  shown  in  (24),  is  used.  This  matrix 
represents  much  more  noisy  prediction  than  at  steady  state. 
Hence,  none  of  the  targets  can  be  neglected  in  this  case.  Fig¬ 
ures  12-14  show  optimal  sensor  positions  when  predictions 
are  relatively  bad  in  y  direction  as  presented  in  (25).  The  fig¬ 
ures  show  that  the  sensors  align  along  the  y  axis  relative  to 
the  targets  to  get  better  position  and  velocity  information  in 
this  direction.  Opposite  of  this  effect  can  be  seen  in  15-17, 
where  due  to  relatively  less  predicted  information  along  z  the 
sensors  align  themselves  along  this  direction. 

5.  Conclusions 

In  this  paper  an  information  theoretic  approach  for  opti¬ 
mal  GMTI  sensor  placement  is  presented.  The  sensors, 
that  measure  target’s  position  along  with  its  radial  veloc¬ 
ity,  are  mounted  on  UAVs  that  are  tracking  a  number  of 
ground  targets.  Target’s  detection  probability  and  sensor’s 
survival  probability  are  considered  while  developing  the  cri¬ 
terion  function.  A  combination  of  randomized  and  nonran- 
domized  techniques  is  used  to  find  the  optimal  sensor  loca¬ 
tions.  Results  are  obtained  for  two  scenarios  each  having  10 
targets  and  for  different  number  of  sensors.  Also,  different 
state  prediction  covariances  are  used  to  observe  their  effect. 
The  overall  track  estimation  accuracy  should  increase  signif¬ 
icantly  with  this  approach.  The  extension  of  this  work  to  a 
dynamic  scenario  is  under  investigation. 
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Figure  3.  Optimal  position  of  2  sensors  for  target  scenario  1 
and  steady  state  value  of  state  prediction  covariance 


Figure  4.  Optimal  position  of  3  sensors  for  target  scenario  1 
and  steady  state  value  of  state  prediction  covariance 


6.  Appendix 

In  the  following,  the  derivations  required  for  the  Newton 
Raphson  method  are  presented.  An  approximation  is  made  by 
assuming  that  the  cross  terms  in  the  second  derivative  of  the 
criterion  function,  related  to  two  separate  sensors,  are  all  zero. 
This  significantly  reduces  the  computational  requirements  by 
allowing  separate  update  for  the  sensors.  For  a  sensor  i  one 
step  update  is  given  by 

s‘new  =  s‘  — /?(V2  (A.l) 

where  s*  is  the  current  x-y  position  of  the  sensor 

s*  =  [  y<  ]  (A.2) 
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Figure  5.  Optimal  position  of  4  sensors  for  target  scenario  1  Figure  7.  Optimal  position  of  3  sensors  for  target  scenario  2 
and  steady  state  value  of  state  prediction  covariance  and  steady  state  value  of  state  prediction  covariance 


x  (km) 

Figure  6.  Optimal  position  of  2  sensors  for  target  scenario  2 
and  steady  state  value  of  state  prediction  covariance 


Figure  8.  Optimal  position  of  4  sensors  for  target  scenario  2 
and  steady  state  value  of  state  prediction  covariance 


The  criterion  function  is  given  by 


■^=Elos( 


Pt(k\k  -  1)  1  +  ^7rs(s)7T£>(s,t)y(A:,  s,  t) 


(A.3) 

Information  of  a  target  j  obtained  by  a  sensor  i ,  assuming 
perfect  detection  and  survival,  is  given  by 


Y(k,ij)  = 
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where  the  components  of  matrix  Y(k,i,j),  obtained  using 
(13)  and  (14),  are  given  by 
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Figure  9  Optimal  position  of  2  sensors  for  target  scenario  1  n  •  ,  ...  r 

. ,  .  ,  %  ~  j.  •  Figure  11.  Optimal  position  of  2  sensors  for  target  scenario 

and  h.gh  value  of  state  pred.ct.on  covanance  2  and  high  value  of  state  pred.ction  covariance 
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Figure  10.  Optimal  position  of  3  sensors  for  target  scenario 
1  and  high  value  of  state  prediction  covariance 


1  and  a  state  prediction  covariance  that  has  low  value  along 
x-direction  and  high  value  along  y-direction 


The  components  can  be  expanded  as 


The  derivatives  of  the  criterion  function  w.r.t.  the  x-y  position 
of  sensor  i  are  given  by 
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Figure  13.  Optimal  position  of  2  sensors  for  target  scenario  Figure  15  Optimal  position  of  2  sensors  for  target  scenario 

2  and  a  state  prediction  covariance  that  has  low  value  along  1  an^  a  state  prediction  covariance  that  has  high  value  along 

x-direction  and  high  value  along  y-direction  x-direction  and  low  value  along  y-direction 


Figure  14.  Optimal  position  of  4  sensors  for  target  scenario  Figure  16  0ptima,  positjon  of  3  sensors  for  ^nano 
2  and  a  state  prediction  covariance  that  has  low  value  along  j  and  a  state  prediction  COvariance  that  has  high  value  along 

x-direction  and  high  value  along  y-direction  x-direction  and  low  value  along  y-direction 


where  Pt(k\k)  is  given  by 

Pt(fc|fc)-1  =  Pt(k\k  -  l)-1  +  Y,  ^s(s)i^d(s,  t)Y (fc,  s,  t) 

(A. 16) 

the  terms  and  are  similar  to  (A.  13)  and  (A.  14),  re¬ 
spectively. 

To  obtain  derivatives  of  the  determinant  of  a  matrix  we  need 
to  compute  derivatives  of  the  matrix.  First,  these  derivatives 
are  obtained  w.r.t.  r(i,j)  and  a(i,  j)9  which  are  the  polar  co¬ 
ordinates  of  target  j  considering  sensor  i  as  the  origin.  Next, 
these  derivatives  are  converted  to  derivatives  w.r.t.  xl  and  yl. 
It  is  important  to  note  that  the  detection  and  survival  proba¬ 


bilities  are  independent  of  azimuth  angle  a.  We  have 
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Figure  17.  Optimal  position  of  4  sensors  for  target  scenario 
2  and  a  state  prediction  covariance  that  has  high  value  along 
^-direction  and  low  value  along  y-direction 
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The  matrices  A  and  B  are  symmetric  and  the  nonzero  com¬ 
ponents  of  these  matrices  are  given  by 
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The  derivatives  w.r.t.  x-y  can  be  obtained  using  the  following 


conversions 


d 

dr(i,j)  d 

da{i,j) 

d 

(A.39) 

dx * 

dx i  dr(i,j) 

dx' 

da{i,j) 

d 

dy * 

dr(i,j)  d 
dy'  dr(i,j) 

da{i,j) 

dy' 

d 

da{i,j ) 

(A  .40) 

d2 

d2r(i,j)  d 

,  fdr(i, 

j)\2 

d2 

dx *2  dxi2  dr(i,j)  \  dx*  )  dr(i,j )2 

2dr(i,j)da(i,j)  d2 

dx1  dx *  <9r(i,7')da(i,7) 

dx*'2  da(i,j)  V  9x*  y  da{i,j)2 

(A.41) 

d2  _  g2r(jj)  a  f  dr(t,j)  \ 2  a2 

dy*2  dy’2  dr(i,j)  V  dy*  /  dr{i,j)2 

dr(i,j)da{i,j)  d2 

dy *  <9y£  <9r(i,  j)<9a(2,  j) 

,  ^(m)  £  /c>c(z,i)\2  a2 

dy2  da{ij)  V  dy’  /  da(i,j)2 

(A  .42) 

d2  _  ^H^j)  d  ^  drji.  j)  drji,  j)  d2 

dx’dy*  dx'dy'  dr(i,j)  dx1  dy ’  dr(i,j)2 

^drji^daiij)  d2 

<9x*  <9?/*  <9r(i,  j)<9^(2,  j) 

a2<*(*,j)  a 

dxidyi  da(i,j) 

^da(i,j)dr(i,j)  d 2 

dx*  dy*  dr(i,j)da(i,j) 

da(t,j)  da(i,j)  d2 

dy*  da(i,j)2 

(A.43) 


The  partial  derivatives  of  r(2,  j)-a(2,j)  w.r.t.  x*-yl  are  given 
by 
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The  derivatives  of  ns{i,j)  and  ^£>(2,7)  are  depends  on  the 

choice  of  these  functions. 
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ABSTRACT 

With  the  recent  advent  of  moderate-cost  unmanned  (or  uninhabited)  aerial  vehicles  (UAV)  and  their  success  in 
surveillance,  it  is  natural  to  consider  the  cooperative  management  of  groups  of  UAVs.  The  problem  considered  in 
this  paper  is  the  optimization  of  the  information  obtained  by  a  group  of  UAVs  carrying  out  surveillance  of  several 
ground  targets  distributed  over  a  large  area.  The  UAVs  are  assumed  to  be  equipped  with  Ground  Moving  Target 
Indicator  (GMTI)  radars,  which  measure  the  locations  of  moving  ground  targets  as  well  as  their  radial  velocities 
(Doppler).  In  this  research  the  Fisher  information,  obtained  from  the  information  form  of  Riccati  equation,  is  used 
in  the  objective  function.  Sensor  survival  probability  and  target  detection  probability  for  each  target-sensor  pair  are 
also  included  in  the  objective  function,  where  the  detection  probability  is  a  function  of  both  range  and  range  rate. 
The  optimal  sensor  placement  problem  is  solved  by  a  genetic  algorithm  based  optimizer.  Simulation  results  on  two 
different  scenarios  are  presented  for  four  different  types  of  prior  information. 

Keywords:  Multisensor- multi  target  tracking,  sensor  management,  cooperative  control,  UAV  placement,  ground 
target  tracking. 


1.  INTRODUCTION 

GMTI  radars  are  used  to  make  observations  of  moving  targets  with  the  objective  of  estimating  the  target  states. 
Multiple  sensors,  which  give  different  perspectives  of  one  or  more  targets  at  the  same  time  or  at  different  times, 
can  be  used  to  enhance  the  estimation  results.  An  important  application  of  control  theory  is  to  manage  multiple 
sensors  such  that  the  expected  information  obtained  from  them  is  maximized.  Management  of  multiple  sensors 
involves  gathering,  exchanging  and  combining  information.  When  the  sensor  platforms  are  mobile,  one  has  to  decide 
the  optimal  placement  of  sensors.  With  the  recent  advent  of  affordable  unmanned  aerial  vehicles  (UAV)  and  their 
proven  effectiveness  in  surveillance,  it  is  natural  to  consider  the  cooperative  management  of  groups  of  UAVs. 

A  number  of  UAV  management  algorithms  can  be  found  in  the  literature.  In  [3]  a  hierarchical  approach,  which 
uses  modified  Voronoi  diagram  to  generate  possible  paths  and  to  intercept  a  number  of  known  targets  using  a  number 
of  UAVs,  is  presented.  Similar  approaches  can  be  found  in  [7,  8].  A  search  algorithm  for  targets  in  a  given  area  is 
proposed  in  [5],  where  a  discrete  time  stochastic  decision  model  is  formulated  as  the  path  planning  problem,  which 
is  then  implemented  with  a  dynamic  programming  algorithm  [9].  However,  the  aim  of  [5]  is  only  to  detect  (not  to 
track)  the  targets  in  the  search  region.  In  [4]  a  decentralized  sensor  management  algorithm  is  presented  based  on  the 
change  in  entropy.  In  this  paper  the  Fisher  information  measure  is  used  in  the  objective  function  in  a  target  cuing 
and  target  hand-off  problem  and  a  multi-platform  bearing  only  tracking  problem. 

In  a  recent  work  [10]  we  have  used  a  Fisher  information  based  approach  somewhat  similar  to  [4].  Here  the  GMTI 
radars  measure  radial  velocity  as  well  as  position  of  the  targets.  Also,  the  survival  probability  of  a  sensor  from 
hostile  fire  by  the  targets  and  detection  probability  of  a  target  corresponding  to  a  particular  sensor  are  considered. 
In  [10]  the  detection  probability  of  a  target  by  a  radar  is  assumed  to  be  dependent  only  on  the  range.  However,  for 
GMTI  radar  the  detection  probability  is  a  function  of  both  range  and  range  rate.  In  this  paper  the  corresponding 
modification  is  done  and  it  is  found  that  target  heading,  which  can  change  the  range  rate  in  an  otherwise  constant 
targe t^sensor  geometry,  influences  the  optimal  sensor  position.  Development  of  the  objective  function  is  discussed  in 
Section  2.  In  Section  3  a  genetic  algorithm  to  find  the  maximum  of  the  objective  function  is  discussed.  Simulation 
results  are  presented  in  Section  4. 
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2.  OBJECTIVE  FUNCTION 


The  objective  function,  based  on  the  Fisher  information  measure  is  the  product  of  uncertainty  volumes  [10] 

J  =  £log|Pj(fc|*)-1|  (1) 

j 

where  Pj(k\k)  is  the  posterior  covariance  matrix  of  the  state  vector  corresponding  to  target  j  at  time  tk  and  can  be 
written  in  terms  of  the  state  prediction  covariance  Pj(k\k  —  1)  and  new  information  Yj(k)  as  follows  [1] 

Pj(fclfc)"1  =  Pj(k\k  -  IT1  +  Yj(k)  (2) 

The  matrix  Y3(k)  is  the  total  new  information  about  target  j  obtained  by  the  different  sensors.  The  information 
obtained  by  a  particular  sensor  s  about  target  j  is  given  by 

Y(k,s,j)  =  H(k,s,j)'R(k,s,j)-1H(k,s,j)  (3) 

where  H(k,s,j)  is  the  measurement  matrix  and  R(k,  s,j)  is  the  measurement  covariance  matrix  corresponding  to 
the  sensor-target  pair  s,  j.  The  new  information  also  depends  on  the  target’s  detection  probabilities  corresponding 
to  each  sensor  and  the  survival  probability  of  the  sensors.  The  new  information  obtained  by  sensor  s  about  target  j 
is  reduced  depending  on  this  sensor’s  survival  probability.  The  modified  information  is  given  by 

Y(k,s,j)=7rs(s)Y(k,s,j)  (4) 


where  ns($)  is  the  total  survival  probability  of  sensor  s,  which  is  equal  to  the  product  of  survival  probabilities  of  this 
sensor  in  view  of  each  target,  i.e., 

ns(s)  =  P[7rs(s,j)  (5) 

j 

Let  us  further  assume  that  target  j  is  detected  exactly  by  a  set  C  of  sensors.  The  probability  of  such  an  event  is 
given  by 

vD(C,j)  =  n  *d(sJ)HC's)(1  -iTD(sJ)y-/(Ctt>  (6) 


where  7r d(s,j)  is  the  detection  probability  of  target  j  by  sensor  s  and  /(C,  s)  is  an  indicator  function  given  by 


/(C,»)  =  {  J 

The  total  information  gain  when  target  j  is  detected  exactly  by  a  set  C  of  sensors  is  given  by 


if  s  G  O 
0  else 


Y{k,C,j)  =  ^2t{k,s,j) 

s£C 


(7) 


(8) 


Hence,  considering  all  possible  sets,  similar  to  C,  the  updated  information  matrix  can  be  written  as 

=  pi(k\k-i)-i+'Eii'D{cj)f(k,c,j)  (9) 

VC 

=  PjWk-l)-1  +  J2nD(C,j)J2*s(s)H(k,s,jyR(k,s,j)-1H(k,s,j)  (10) 


VC 


sGC 


It  can  be  shown  that  (10)  reduces  to 

=  Pj(k\k -l)~' +Y27rs(s)TrD(s,j)H{k,sJ)'R(k,s,j)~iH(k,s,j)  (11) 


The  state  vector  of  target  j  is  taken  as 


X  =  [  Xi  vi  u>  v{  ] 


(12) 


Figure  1:  Detection  probability  and  survival  probability  w.r.t.  range. 


where  x  and  y  are  the  position  components  in  Cartesian  coordinate  and  vxy  vy  are  the  velocity  components.  The 
approximate  measurement  matrix,  which  comprises  x-y  position  and  radial  velocity  r,  of  target  j  corresponding  to 
sensor  s,  is  given  by 


H{k ,  a,j) 


10  0  0 
0  0  10 
0  cosa(fc,  s,  j)  0  sin  a(k,s,j) 


(13) 


where  a(/c,  s,  j)  is  the  azimuth  angle  of  the  target  j  measured  by  sensor  s  at  time  t The  original  position  measure¬ 
ments  are  in  the  form  of  range  r(k,  s,  j)  and  azimuth  a(k ,  s,  j)  which  are  converted  to  x,y  position  using  the  standard 
conversion  [1].  The  measurement  covariance  matrix  R(k,$yj)  is  given  by 


where  (skipping  the  arguments) 


R(k,s,j) 


Rl,l  Ri,2  0 
Rl,2  R22  0 

0  0  a? 


2  2  •  2  2  2 
Ri.i  —  t  afflsinQ  +arcosa 

R22  =  r2a^  cos  a2  +  a2  sin  a2 

R1.2  =  (a2  —  r2a2)  sin  a  cos  a 


(14) 


(15) 

(16) 
(17) 


The  detection  probability  of  target  j  by  sensor  s  can  be  approximated  as 

kd(sJ)  =  Kb{s,j)ir2D{s,j)  (18) 

where  7r^(s,j)  is  the  detection  probability  as  a  function  of  the  range  and  is  the  detection  probability  as  a 

function  of  the  range  rate.  The  nature  of  the  first  term  is  shown  in  Figure  1  and  the  derivation  of  the  second  term 
is  discussed  in  the  following. 

For  a  GMTI  radar  if,  the  magnitude  of  the  measured  value  of  the  range  rate  for  a  target  falls  below  a  threshold 
rmin  then  the  target  will  not  be  detected.  Hence,  n ^(s,  j)  is  given  by 

*2d(sJ)  =  1  -p{-rmin  <  r{k,s,j)  <  rmin  |  r(k,s,  j|fc  - 1), 0f(k,s, j\k  -  l)2}  (19) 

where  f(k,  s,  j)  is  the  measured  range  rate  and  f(fc,  s,j\k  —  1)  is  the  predicted  range  rate  given  by 

f(k ,  s,  j\k  —  1)  =  —  l)cos  a(k,  s,  j\k  —  1)  +  vl{k\k  —  1)  sin  a(/c,  s,  j,  \k  —  1)  (20) 


here  vi(k\k  —  1),  v^(k\k  —  1)  are  the  components  of  predicted  velocity  and  a(k ,  s,  j,  | A:  —  1)  is  the  predicted  azimuth 
angle  for  target  j. 

The  variance  term  o+  (k,  s,  j\k  —  l)2  in  (19)  is  the  range  rate  measurement  prediction  variance  which  is  the  third 
diagonal  term  of  the  innovation  covariance  matrix 

S(k,sJ)  =  H(k,sJ)Pj(k\k^l)H(kisjY  +  R(k,sJ)  (21) 

where  in  H(k,  s,j )  the  measured  azimuth  angle  a(Ar,  s,j)  is  replaced  by  the  predicted  azimuth  angle  a(k,  s,j\k  —  1). 

Hence  Kp(s,j)  in  (19)  is  evaluated  by  integrating  a  Gaussian  density  with  mean  r(k,  s,j\k  —  1)  and  variance 
cr+(k,  s,j\k  -  l)2  in  the  interval  -rmjn  to  rmjn  and  subtracting  the  result  from  unity. 

Figure  1  shows  a  notional  survival  probability  and  detection  probability  vs.  range  between  the  target  and  the 
sensor.  It  is  important  to  note  that  in  a  real  life  scenario  the  survival  and  detection  probability  depend  on  the  terrain 
topography. 


3.  SEARCH  TECHNIQUES 

In  this  work  we  have  used  a  genetic  algorithm  to  find  the  global  maximum  of  the  objective  function.  This  is  a 
randomized  technique  suitable  for  problems  with  multiple  maxima.  To  apply  this  algorithm  in  the  current  problem, 
the  positions  of  sensors  are  discretized  to  1  meter  resolution  and  converted  to  a  bit  string.  A  population  size  of  100 
is  used  in  this  application.  First  generation  is  obtained  randomly  and  next  generations  are  obtained  by  a  crossover 
operation,  which  is  applied  separately  on  the  x  and  y  positions  of  each  sensor.  The  parents  are  selected  randomly 
depending  on  their  fitness,  which  is  the  value  of  the  objective  function  corresponding  to  the  sensor  positions  indicated 
by  the  parent.  Along  with  the  crossover  operation,  the  bits  of  the  candidates  in  the  next  generation  can  also  change 
due  to  mutation  operation.  In  this  application  the  mutation  probability  of  each  bit  is  1/30.  Elitism  was  used  to 
preserve  best  parents  in  the  next  generation.  Although  its  rate  of  convergence  slows  down  close  to  the  maxima,  this 
algorithm  eventually  reaches  the  global  maximum  without  being  trapped  by  the  local  ones. 


4.  RESULTS 

In  this  section  we  present  the  results  obtained  on  two  scenarios  using  the  search  technique  discussed  in  Section  3  for 
the  objective  function  discussed  in  Section  2.  Both  scenarios  consist  of  two  sets  having  three  targets  each  and  one  set 
having  four  targets.  In  the  first  scenario  the  target  sets  form  a  triangular  shape  (Figure  2)  and  in  the  second  one  the 
targets  form  a  linear  shape  (Figure  6).  Three  types  of  target  velocities  are  considered,  the  components  of  velocity  are 
[0,  —10]  m/s,  [10,  0]m/s  and  [5y/2,  —  5\/2]m/s  and  will  be  referred  as  type  1,  type  2  and  type  3,  respectively.  In  all 
scenarios  the  height  of  the  sensors  above  the  ground  is  considered  to  be  1  km  and  the  ground  is  considered  to  be  flat. 
The  measurement  noise  standard  deviations  are  ar  =■  5  m,  a$  =  1°  and  —  lm/s.  The  minimum  detectable  range 
rate  fm jn  is  taken  to  be  2  m/s  for  this  simulation.  To  evaluate  the  objective  function  a  state  prediction  covariance 
matrix  P3(k\k  —  1)  is  required  for  each  target  j.  The  steady  state  value  of  P3(k\k  —  1)  can  be  used,  which  is  obtained 
using  the  information  filter  form  of  Riccati  equation  given  by 

Pjil  +  III)"1  =  Aj(l)  -  Aj(l)  (Aj(l)  +  Q(0-1)_1  Aj(l)  (22) 


and 

Aj(i)  =  (fv)-1)' wi \i-ir1 

+  y,  H&sjymsjr'H^sjvFV)-1 

stCjil) 


(23) 


where  C3(l)  is  the  set  of  sensors  that  detects  target  j  at  time  ti  and  F(l)  is  the  transition  matrix  between  time  ti  to 
ti+ 1,  which  depends  on  the  revisit  time  A/  =  ti+\  —  is  given  by 

1  A*  0  0 

0  10  0 
0  0  1  A* 

0  0  0  1 


Fj(l)  = 


(24) 


The  process  noise  matrix  Q(l)  is  obtained  following  a  discrete  white  noise  acceleration  model 


<3(0  = 


±A?  0  0 

l  A?  A?  0  0 

0  0  tA/  ±A? 

0  0  |a?  A2 


(25) 


where  a2  is  the  variance  of  the  white  noise  acceleration  process.  In  our  simulations  <JV  =  0.5  m/s2.  One  important 
property  of  the  information  form  of  Riccati  equation  is  that  the  information  from  different  sensors,  if  they  are 
independent,  can  be  written  in  the  summation  form  as  in  (23). 


Figure  2:  Scenario  for  the  calculation  of  the  steady  state  prediction  covariance  matrix. 


The  steady  state  covariance  matrix  is  obtained  using  the  scenario  shown  in  Figure  2.  For  simplicity  it  is  assumed 
that  the  targets  move  in  the  same  direction  as  the  sensors  and  their  relative  position  remains  the  same  over  time. 
The  revisit  interval  is  assumed  to  be  A /  =  10  sec  for  all  £/.  The  resulting  steady  state  prediction  covariance  matrices 
are  different  for  the  different  targets.  It  is  also  observed  that  the  correlations  between  the  x  component  of  position  or 
velocity  and  the  y  component  of  position  or  velocity  vary  and  can  have  both  positive  and  negative  values.  However, 
correlations  between  the  position  and  velocity  components  of  any  particular  direction  (x  or  y)  are  very  close  to  1/3. 
The  following  steady  state  prediction  covariance  matrix,  which  shows  an  average  behavior,  was  selected  for  each 
target 


Pi  = 
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(26) 


We  also  evaluate  the  effect  of  a  state  prediction  covariance  matrix,  that  is  far  from  the  steady  state,  on  the  optimal 
sensor  positions.  The  following  matrix  is  used  for  this  purpose 


P2  = 


1002 

500 

3 
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500 

5*2 
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0 

0 
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0 
0 

500 

500  t-*2 


(27) 


Figures  3-5  show  the  optimal  positions  of  three  sensors  for  the  first  target  scenario  and  a  low  predicted  covariance 
Pi  in  (26).  It  can  be  seen  that  the  optimal  sensor  positions  depend  on  the  headings  of  the  targets.  Also,  the  closest  two 
sensors  form  a  close  to  right  angle  triangle  formation  at  each  target  so  that  they  obtain  complimentary  information 
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Optimal  position  of  three  sensors  for  target  scenario  1  with  a  low  state  prediction  covariance  and  velocity 
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Figure  4:  Optimal  position  of  three  sensors  for  target  scenario  1  with  a  low  state  prediction  covariance  and  velocity 
type  2. 


about  the  target.  There  is  also  a  tendency  for  the  sensors  to  concentrate  on  the  group  of  four  targets  as  this  increases 
the  obtainable  information. 

Figures  6-8  show  the  similar  nature  of  the  optimal  positions  of  three  sensors  for  the  second  target  scenario. 
However,  in  each  of  these  cases,  due  to  symmetry,  a  mirror  image  of  sensors  w.r.t.  the  targets  along  the  x  axis 
produces  another  set  of  optimal  sensor  positions.  In  Figure  8  the  sensors  abandon  the  three  targets  at  the  bottom 
and  get  the  information  about  the  rest  of  the  targets. 

Figure  9  shows  optimal  positions  of  three  sensors  with  targets  from  the  second  scenario  and  each  of  them  having 
velocity  type  3.  The  difference  from  Figure  8  is  that  in  this  case  the  targets  have  a  high  prediction  covariance  P2  as 
shown  in  (27).  Because  of  this  high  prediction  uncertainty  the  sensors  are  placed  more  uniformly  from  the  targets 
compared  to  those  in  Figure  8. 

Finally,  Figures  10  and  11  show  the  optimal  positions  of  two  sensors  for  the  targets  of  scenario  1  moving  along 
the  negative  y  direction.  A  low  prediction  covariance  matrix  Pi  is  used  for  the  first  figure,  while  a  high  prediction 
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Figure  5:  Optimal  position  of  three  sensors  for  target  scenario  1  with  a  low  state  prediction  covariance  and  velocity 
type  3. 
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Figure  6  Optimal  position  of  three  sensors  for  target  scenario  2  with  a  low  state  prediction  covariance  and  velocity 
type  1. 


covariance  matrix  P2  is  used  in  the  second  case.  Similar  to  the  previous  observations,  the  sensors  in  the  second  case 
place  themselves  more  uniformly  w.r.t.  the  targets. 

5.  CONCLUSIONS 

In  this  paper  a  Fisher  information  based  approach  for  optimal  GMTI  sensor  placement  is  presented.  The  sensors, 
which  measure  targets’  position  along  with  its  radial  velocity,  are  on  UAVs  that  are  tracking  a  number  of  ground 
targets.  The  target  detection  probability  and  sensor  survival  probability  are  considered  while  developing  the  objective 
function.  The  target  detection  probability  depends  on  the  target  range  as  well  as  range  rate.  A  genetic  algorithm 
based  search  techniques  is  used  to  find  the  optimal  sensor  locations.  Results  are  obtained  for  two  scenarios  each 
having  10  targets  and  for  different  number  of  sensors.  Also,  different  state  prediction  covariances  and  velocity 
directions  are  used  to  observe  their  effect.  The  overall  track  estimation  accuracy  should  increase  significantly  with 
this  approach.  The  extension  of  this  work  to  a  dynamic  scenario  is  under  investigation. 
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Figure  7:  Optimal  position  of  three  sensors  for  target  scenario  2  with  a  low  state  prediction  covariance  and  velocity 
type  2. 


Figure  8:  Optimal  position  of  three  sensors  for  target  scenario  2  with  a  low  state  prediction  covariance  and  velocity 
type  3. 
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Abstract — With  the  recent  advent  of  moderate-cost  unmanned 
(or  uninhabited)  aerial  vehicles  (UAV)  and  their  success  in 
surveillance,  it  is  natural  to  consider  the  cooperative  manage¬ 
ment  of  groups  of  UAVs.  The  problem  considered  in  this 
paper  is  the  optimization  of  the  information  obtained  by  a 
group  of  UAVs  carrying  out  surveillance  of  several  ground 
targets  distributed  over  a  large  area.  The  UAVs  are  assumed 
to  be  equipped  with  Ground  Moving  Target  Indicator  (GMTI) 
radars,  which  measure  the  locations  of  moving  ground  tar¬ 
gets  as  well  as  their  radial  velocities  (Doppler).  In  this  pa¬ 
per,  a  cooperative  control  algorithm  is  proposed,  according  to 
which  each  UAV  decides  its  path  independently  based  on  an 
information  theoretic  criterion  function.  The  criterion  func¬ 
tion  also  incorporates  target  detection  probability  and  survival 
probability  for  sensors  corresponding  to  hostile  fire  by  targets 
as  well  as  collision  with  other  UAVs.  The  control  algorithm 
requires  limited  communication  and  modest  computation. 
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1.  Introduction 

An  important  application  of  control  theory  is  to  manage  mul¬ 
tiple  sensors  such  that  the  expected  information  obtained 
from  them  is  maximized.  Management  of  multiple  sen¬ 
sors  involves  gathering,  exchanging  and  fusing  information. 
With  the  recent  advent  of  affordable  unmanned  aerial  vehi¬ 
cles  (UAV)  and  their  proven  effectiveness  in  surveillance,  it 
is  natural  to  consider  the  cooperative  management  of  groups 
of  UAVs. 

A  number  of  UAV  management  algorithms  can  be  found  in 
the  literature.  In  [3]  a  hierarchical  approach,  which  uses  a 
modified  Voronoi  diagram  to  generate  possible  paths  and  to 
intercept  a  number  of  known  targets  using  a  number  of  UAVs 
in  the  presence  of  dynamic  threats,  is  presented.  Similar  ap¬ 
proaches  can  be  found  in  [8],  [9].  A  search  algorithm  for 
targets  in  a  given  area  is  proposed  in  [6],  where  a  discrete 
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time  stochastic  decision  model  is  formulated  as  the  path  plan¬ 
ning  problem,  which  is  then  implemented  with  a  dynamic 
programming  algorithm  [11].  However,  the  aim  of  [6]  is 
only  to  detect  (not  to  track)  the  targets  in  the  search  region. 
In  [5]  a  decentralized  sensor  management  algorithm  is  pre¬ 
sented  based  on  maximizing  information  gain.  In  this  pa¬ 
per,  a  online  bargaining  arrangement  between  the  sensors  was 
used  to  decide  assignment  of  known  targets  in  two  different 
sensor  management  problems.  The  first  one  addresses  a  tar¬ 
get  cueing  and  hand-off  problem  related  to  static  sensors  and 
dynamic  targets,  while  the  second  problem  is  to  generate  a 
coordinated  sensor-platform  trajectory  to  obtain  information 
about  static  features  (targets).  Another  algorithm,  presented 
in  [10],  uses  a  coordination  framework  of  virtual  bodies  and 
artificial  potentials  for  cooperative  control  of  mobile  sensor 
networks.  This  algorithm  focuses  on  “gradient  climbing  mis¬ 
sions  in  which  the  mobile  sensor  network  seeks  out  local 
maxima  or  minima  in  the  environmental  field”. 

In  this  paper,  a  cooperative  control  algorithm  is  developed  for 
a  number  of  UAVs,  equipped  with  GMTI  sensors,  tracking 
multiple  ground  targets.  An  information  theoretic  criterion, 
somewhat  similar  to  [5],  is  used  to  select  the  future  path  of  an 
UAV  such  that  the  total  information,  obtainable  by  the  sen¬ 
sors  in  the  UAVs  as  a  group,  corresponding  to  the  detected 
targets,  is  maximized.  The  algorithm  proposed  in  this  paper, 
unlike  the  one  in  [5],  assumes  no  prior  information  about  the 
number  of  targets  or  their  positions.  The  criterion  function 
used  in  this  paper,  which  was  partly  presented  in  [12],  takes 
into  account  the  detection  probabilities  of  targets,  which  are 
based  on  both  range  and  range  rate,  and  survival  probabili¬ 
ties  of  the  sensors  due  to  hostile  fire  from  targets  and  possible 
collision  with  other  UAV s,  in  computing  the  information  for 
a  particular  target-sensor  geometry.  The  simulation  results 
show  that  the  algorithm  enables  a  group  of  UAVs  to  gather 
information  in  a  cooperative  manner  in  the  region  of  interest. 
The  algorithm  can  incorporate  initial  information  if  available; 
however,  it  can  also  work  without  any  initial  knowledge.  This 
adaptive  algorithm  can  account  for  new  targets  as  they  appear. 
The  computational  complexity  and  communication  require¬ 
ments  of  this  algorithm  are  modest  enough  for  the  realtime 
applications  related  to  small  UAVs  with  limited  computing 
power.  Finally,  because  of  its  distributed  approach  the  algo¬ 
rithm  is  robust  and  hence,  it  does  not  fail  in  the  eventuality  of 
the  loss  of  some  UAVs  or  communication  failure. 
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The  paper  is  organized  as  follows.  Development  of  the  ob¬ 
jective  function  is  discussed  in  Section  2.  In  Section  3  the 
cooperative  control  algorithm  is  described  in  detail.  Simula¬ 
tion  results  are  presented  in  Section  4  and  Section  5  presents 
the  concluding  remarks. 

2.  Objective  Function 

The  objective  function,  based  on  the  Fisher  information  mea¬ 
sure  is  the  product  of  uncertainty  volumes  [12] 


In  this  work,  the  UAVs  are  assumed  to  be  equipped  with 
GMTI  sensors  and  the  state  vector  of  target  j  is  taken  as 

X  =  [  X3  v£  yi  v>  ]  (8) 

where  x  and  y  are  the  position  components  in  Cartesian  co¬ 
ordinate  and  vXy  vy  are  the  velocity  components.  The  mea¬ 
surement  vector  comprises  x-y  position  and  radial  velocity  r. 
The  measurement  matrix,  of  target  j  corresponding  to  sensor 
5,  is  given  by 


J  =  ^log|P,(*|fc)-,|  (1) 
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where  Pj(k\k)  is  the  posterior  covariance  matrix  of  the  state 
vector  corresponding  to  target  j  at  time  tk  and  can  be  written 
in  terms  of  the  state  prediction  covariance  P3(k\k—  1)  and  the 
new  information  Yj(k,  s)  obtainable  by  the  sensors  (UAVs)  as 
follows  [1] 

Pj(k\k)~'  =  Pj(k\k  —  l)-1  +  y'-KS(s)-xD(s,j)Yj(k,s) 

S 

(2) 

where  kd{sJ)  is  the  detection  probability  of  target  j  by  UAV 
s.  This  probability  can  be  expressed  as 


H(k,s,  j) 


10  0  0 
0  0  10 
0  cosa(Ar,s,j)  0  sin  a(k,s,j) 


(9) 

where  a(/c,  s,  j)  is  the  azimuth  angle  of  the  target  j  measured 
by  sensor  s  at  time  tk.  The  original  position  measurements 
are  in  the  form  of  range  r(fc,  s,j)  and  azimuth  a(k ,  s,j)  in 
the  presence  of  noise,  which  are  converted  to  xyy  position 
using  the  standard  conversion  [1].  The  original  position  mea¬ 
surements  in  range,  azimuth  angle  and  range  rate  contains 
independent  additive  Gaussian  noise  and  the  corresponding 
noise  variances  are  given  by  cr2,  a2  and  cr2,  respectively.  Af¬ 
ter  conversion,  the  measurement  covariance  matrix  R(k,  s,  j) 
is  given  by 


^D(s,j)  =  n1D(s,j)ir%(sJ)  (3) 

where  ^(s,  j)  is  the  detection  probability  as  a  function  of 
the  range  and  n2D(s,j)  is  the  detection  probability  factor  as  a 
function  of  the  range  rate.  The  nature  of  the  first  term  is  dis¬ 
cussed  in  Section  4.  The  second  term  is  the  probability  that 
the  range  rate  measured  by  the  GMTI  sensor  is  higher  than  the 
“minimum  detectable  velocity”  (MDV)  and  the  correspond¬ 
ing  computation  is  available  in  [1 2]  The  term  7rs(s)  in  (2)  is 
the  total  survival  probability  of  UAV  s,  which  is  equal  to  the 
product  of  target-fire  survival  probability  71-5  (s)  and  collision 
survival  probability  7r|(s)  of  this  UAV,  i.e., 

*s{s)  =  7is(s)7r|(s)  (4) 

where  715(5),  in  turn,  is  the  product  of  target-fire  survival 
probabilities  of  UAV  s  in  view  of  each  target,  i.e., 

(5) 
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and  7r|(s)  is  the  product  of  collision  survival  probabilities 
corresponding  to  all  other  UAVs 

*!(«)  =  n  *1(3,  i)  (6) 

i,i^s 

The  information  Yj(k,  s)  obtained  by  a  particular  UAV  s 
about  target  j  is  given  by 

Yj{k,S)  =  H{k,  s,j)'R(k,  sJ)~lH(k,  s,j)  (7) 

where  H(kys,j)  is  the  measurement  matrix  and  R(k,  s,j) 
is  the  measurement  noise  covariance  matrix  corresponding  to 
the  sensor-target  pair  5,  j. 


R(k,s,j) 


Ri,\  R  1,2  0 

R\,2  Ri,2  0 

0  0  a, ? 


where  (skipping  the  arguments) 

OO  OO  O 

R\yi  =  r  <rasina  +<rrcosa 
#2,2  =  r2a£  cos  a2  4-  a2  sin  a2 

R\%2  =  (<r2  —  r2cr2)  sin  a  cos  a 


GO) 


(ID 

(12) 

(13) 


As  shown  in  [13]  one  can  use  in  (9)  the  observed  azimuth. 

3.  Algorithm 

In  this  section,  the  algorithm  followed  by  each  UAV,  which 
results  in  cooperative  control  of  the  UAVs  as  a  group,  is  dis¬ 
cussed  in  detail.  Each  UAV  performs  the  tasks  shown  in  Fig¬ 
ure  1  asynchronously  w.r.t.  the  other  UAV s.  Each  UAV  scans 
its  environment  using  a  GMTI  sensor  at  an  interval  of  T.  The 
detections,  along  with  this  UAV’s  current  position  and  veloc¬ 
ity,  are  then  transmitted  to  the  other  UAVs.  There  is  a  nc 
probability  that  another  UAV  will  receive  this  transmission. 
Each  UAV  maintains  its  own  set  of  target  tracks,  which  are 
updated  when  a  new  set  of  detections  is  either  obtained  by 
this  UAV  or  received  from  another  UAV. 

After  scanning  for  measurements,  transmitting  them  and  up¬ 
dating  the  target  tracks,  each  UAV  determines  its  path  for  the 
next  interval  T.  For  a  coordinated  operation  this  decision  de¬ 
pends  on  the  corresponding  UAV’s  knowledge  about  the  cur¬ 
rent  locations  of  the  other  UAVs  and  the  current  state  of  the 
target  tracks  maintained  by  it.  The  criterion  function  J  in  (1) 
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in  each  group  is  70  m.  The  target  groups  move  in  different 
directions. 


Yes 


Figure  1 .  Flow  chart  of  the  tasks  of  each  UAV. 


is  maximized  to  obtain  the  paths  of  the  UAVs.  The  knowledge 
of  the  state  of  another  UAV  is  updated  when  information  is  re¬ 
ceived  from  that  UAV.  In  case  of  a  failure  in  communication, 
the  path  decided  for  the  particular  UAV  in  the  last  iteration  is 
assumed  to  be  its  actual  path.  If  no  information  is  received 
from  a  particular  UAV  for  a  number  of  times,  the  correspond¬ 
ing  UAV  is  considered  to  be  lost. 


The  number  of  UAVs  considered  for  different  cases  is  from 
2  to  4  and  each  of  them  starts  at  y  position  of  - 12  km  while 
keeping  a  distance  of  1  km  from  the  closest  ones  along  the 
x  direction.  Initially,  the  UAVs  move  at  a  rate  of  30  m/s 
along  the  -fy  direction.  The  UAVs  start  with  no  knowledge 
about  the  targets  and  continue  in  the  +y  direction  until  target 
tracks  are  formed.  Once  target  measurements  are  obtained 
and  tracks  are  formed,  each  UAV  decides  its  path  by  maxi¬ 
mizing  J  in  (1)  based  on  its  knowledge  of  the  positions  of  the 
other  UAVs  and  target  tracks  maintained  by  the  correspond¬ 
ing  UAV.  Each  UAV  repetitively  performs  a  set  of  operations 
as  discussed  in  Section  3.  For  this  simulation,  T  is  5  s,  which 
means  each  UAV  performs  the  set  of  tasks  shown  in  Figure  1 
within  that  time.  Also,  the  success  probability  of  a  communi¬ 
cation  between  two  UAVs,  which  is  denoted  by  nc  in  Section 
3,  is  0.9.  In  this  simulation  one  point  track  initialization  [13] 
is  applied  and  track  maintenance  is  performed  by  a  two  stage 
procedure:  measurement  to  track  association,  which  is  per¬ 
formed  by  the  auction  algorithm  [4],  and  track  update  using 
a  Kalman  filter.  A  white  noise  acceleration  model  is  assumed 
for  the  targets  with  process  noise  standard  deviation  (s.d.) 
being  1  m/s2.  The  measurement  noise  s.d.  are  ar  =  5  m, 
(7$  =  10_3rad  and  oy  =  lm/s.  The  probability  of  false 
alarm  in  this  simulation  is  10~6  (in  a  resolution  cell  of  size 
20  m  x  4  mrad  x  4  m/s). 


Commonly  for  the  tracking  algorithms  presented  in  the  lit¬ 
erature,  a  track  is  deleted  if  it  is  not  associated  with  mea¬ 
surements  for  more  than  a  predetermined  number  of  updates. 
However,  this  rule  is  not  based  on  the  observability  criterion 
and  may  result  in  deletion  of  tracks  because  they  are  unob¬ 
servable  by  the  sensors  used.  To  avoid  this,  in  this  work,  a 
quantity  71^^  is  updated  each  time  a  set  of  detections  is  re¬ 
ceived,  as  follows 


"track** +  !)={  7rtrack(fc)1(1  *d) 


if  not  associated 
otherwise 


(14) 

where  tvq  is  the  probability  of  detection  of  the  particular 
track.  If  7^^  falls  below  a  predetermined  fraction,  10“3 
in  this  simulation,  the  track  is  deleted  from  the  track  list. 


After  setting  its  course  for  next  interval  T  each  UAV  then 
waits  for  any  transmission  from  the  other  UAVs  as  shown 
in  Figure  1 .  If  a  new  set  of  detections  is  received  from  an¬ 
other  UAV,  then  the  target  tracks,  maintained  by  this  UAV, 
are  updated.  After  T  seconds  the  UAV  once  again  scans  its 
environment  with  its  GMT1  sensor  and  so  on. 

4.  Results 

In  this  section,  we  present  the  simulation  results  obtained  for 
a  scenario  which  has  10  targets  moving  in  three  groups  as 
shown  in  Figure  4.  The  first  group  has  4  targets  and  the  other 
two  have  3  targets  each.  The  separation  between  the  targets 


In  this  simulation,  the  altitude  of  the  UAVs  above  the  ground 
is  considered  to  be  1  km  and  the  ground  is  considered  to  be 
flat.  The  UAVs  fly  at  a  constant  speed  of  30  m/s  and  can  per¬ 
form  coordinated  turns  with  angular  turn  rate  upto  3°/s.  To  set 
its  path  for  the  next  period,  each  UAV  decides  its  angular  him 
rate  by  maximizing  J  in  (1 ).  Since  J  also  includes  the  angular 
turn  rates  of  the  other  UAVs,  these  quantities  are  also  deter¬ 
mined  in  a  joint  maximization  procedure.  Matlab  function 
’ftnincon’  is  used  to  perform  this  constrained  optimization. 
The  angular  turn  rates  of  the  other  UAVs  are  not  transmitted 
as  each  UAV  performs  this  operation  independently. 

Figure  2  shows  the  survival  probabilities  and  detection  proba- 
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Figure  2  Detection  and  survival  probabilities  assumed  in 
this  simulation. 


bility  as  a  function  of  range,  as  used  in  this  simulation.  It  can 
be  seen  that  the  survival  probabilities  increase  as  the  target¬ 
sensor  distance  and  sensor-sensor  distance  increase.  How¬ 
ever,  the  detection  probability  w.r.t.  range  increases  as  the  dis¬ 
tance  between  target- sensor  decreases.  It  is  important  to  note 
that  in  a  real  life  scenario  the  survival  and  detection  probabil¬ 
ities  depend  on  the  terrain  topography  and  the  algorithm  pro¬ 
posed  in  this  work  is  applicable  for  any  assumption  on  these 
probabilities.  The  detection  probability  factor  as  a  function 
of  the  range  rate  of  a  target  w.r.t.  a  sensor  is  a  step  function 
being  1  if  the  target  range  rate  magnitude  is  more  that  2  m/s 
and  0  otherwise. 


Figure  3.  Contour  plot  of  criterion  function  w.r.t.  different 
position  of  a  fourth  UAV. 


different  positions  of  the  fourth  UAV  in  a  scenario  that  has  3 
UAVs  tracking  10  targets.  The  criterion  function  drops  near 
the  targets  or  the  sensors  forming  a  barrier.  The  reason  of 
this  behavior  is  the  very  low  survival  probability  of  the  fourth 
UAV  close  to  the  targets  or  other  sensors.  Farther  from  the  tar¬ 
gets,  there  are  local  minima  in  the  direction  of  target  motion 
The  global  minimum  is  near  the  center  of  the  plot  between  3 
targets  groups. 


Figure  4.  A  typical  set  of  paths  for  2  UAVs  when  tracking 
10  targets. 


Figure  5.  20  run  position  RMSE  for  tracks  maintained 
separately  by  2  UAV s. 


Figure  4  shows  a  typical  set  of  paths  of  UAVs  w  hen  only  two 
of  them  are  present.  It  can  be  seen  that  UAV  1  tracks  target 
group  1  and  UAV  2  tracks  target  groups  2  and  3*.  The  par¬ 
ticular  attention  to  group  1  can  be  explained  by  the  fact  that 


Figure  3  shows  the  contour  plot  of  the  criterion  function  for 
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1  None  of  the  UAVs  arc  assigned  to  a  particular  target  group.  However,  they 
can  move  to  a  particular  target  group  if  that  maximizes  the  total  information 
and  in  doing  so  the  other  groups  go  out  of  its  detectable  range. 


Figure  6.  20  run  velocity  RMSE  for  tracks  maintained 
separately  by  2  UAVs. 


this  group  has  4  targets,  one  more  than  the  other  groups.  Fig¬ 
ures  5  and  6  show  a  20  run  position  and  velocity  RMSE  of 
the  tracks  maintained  separately  by  the  two  UAVs.  Since  the 
UAVs  start  close  to  target  Group  2,  these  targets  are  detected 
first  and  the  RMSEs  of  the  corresponding  tracks  decrease  as 
the  UAVs  start  moving  in  their  direction.  However,  once  other 
targets  are  detected,  the  UAVs  start  moving  towards  them  and 
the  RMSEs  of  the  tracks  corresponding  to  target  group  2  in¬ 
crease  and  that  for  the  other  groups  decrease.  At  the  end  of 
the  simulations,  the  tracks  corresponding  to  target  group  1 
have  minimum  position  and  velocity  RMSE. 


Figure  7.  A  typical  set  of  paths  for  3  UAVs  when  tracking 
10  targets. 


Figure  7  shows  a  typical  set  of  paths  of  3  UAVs  when  track¬ 
ing  the  ten  targets  of  the  target  scenario.  In  this  case,  UAV  2 
tracks  target  group  1  and  UAV  1  tracks  target  groups  2  and  3, 


Figure  8.  20  run  position  RMSE  for  tracks  maintained 
separately  by  3  UAVs. 


Figure  9.  20  run  velocity  RMSE  for  tracks  maintained 
separately  by  3  UAVs. 


while  UAV  3  pays  attention  to  all  three  target  groups.  Figures 
8  and  9  show  a  20  run  track  position  and  velocity  RMSE  when 
the  targets  are  tracked  by  3  UAVs.  Once  again,  target  group  2 
is  detected  first  and  the  corresponding  tracks  have  lower  po¬ 
sition  and  velocity  RMSE  in  the  first  part  of  each  simulation. 
However,  once  the  other  targets  are  detected,  the  UAVs  start 
moving  towards  them.  Finally,  the  tracks  from  target  group 
1  have  the  highest  position  accuracy  and  all  tracks  have  ve¬ 
locity  error  of  the  similar  magnitude.  The  overall  RMSE  of 
these  tracks  are  considerably  smaller  than  those  of  the  tracks 
formed  by  two  UAVs. 

Figure  10  shows  a  typical  set  of  paths  of  4  UAVs  when  track¬ 
ing  the  ten  targets  of  the  scenario.  In  this  case,  UAV  1 , 2  and  4 
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Figure  10  A  typical  set  of  paths  for  4  UAVs  when  tracking 
1 0  targets. 


Figure  11.  20  run  position  RMSE  for  tracks  maintained 
separately  by  4  UAVs. 


tracks  one  group  each  and  UAV  3  stays  in  between  and  looks 
over  all  groups.  Figures  1 1  and  1 2  show  a  20  run  position  and 
velocity  RMSE  of  the  tracks  corresponding  to  the  ten  targets. 
In  this  case,  both  of  the  RMSEs  are  even  smaller.  These  fig¬ 
ures  show  that  at  the  final  stage  of  the  runs  target  set  3  gets 
less  attention  from  the  UAVs  as  they  concentrate  their  efforts 
on  the  tracks  corresponding  to  the  other  two  target  groups 
which  are  close  to  each  other. 

Figure  13  shows  the  path  of  the  targets  and  3  UAVs  for  the 
first  25  minutes  of  a  typical  simulation.  In  this  case,  UAV  I 
is  tracking  target  groups  2  and  3,  UAV  2  is  tracking  all  target 
groups  and  UAV  3  is  tracking  target  group  I .  UAV  3  is  lost  at 
25  minutes  and  Figure  1 4,  which  is  the  continuation  of  Figure 


Figure  12.  20  run  velocity  RMSE  for  tracks  maintained 
*  separately  by  4  UAV s. 


Figure  13.  A  typical  set  of  paths  for  3  UAVs  before  UAV  3 
is  lost  when  tracking  10  targets. 


13,  shows  the  path  of  the  remaining  UAVs  and  targets  after 
UAV  3  is  lost.  The  end  of  the  UAV  paths  in  Figure  13  and 
the  beginning  of  the  paths  in  Figure  14  are  marked  by  small 
circles.  From  Figure  14  it  can  be  seen  that  in  the  absence  of 
UAV  3,  UAV  2  moves  towards  target  group  I  and  it  simul¬ 
taneously  tracks  target  group  2,  while  UAV  I  keeps  tracking 
target  groups  2  and  3.  Figures  15  and  16  show  the  position 
and  velocity  RMSE  of  the  tracks  maintained  separately  by 
the  UAVs.  It  can  be  seen  that  the  position  and  velocity  RMSE 
increase  after  UAV  3  is  lost. 

Figure  1 7  shows  a  typical  path  of  3  UAV s  when  the  target-fire 
survival  probabilities  of  the  UAVs  are  not  considered  in  the 
criterion  function.  In  this  simulation,  the  minimum  distance 
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rigure  14  A  typical  set  of  paths  for  2  UAVs  when  tracking 


10  targets  after  an  UAV  is  lost,  (time  0-25  min) 


Figure  16.  Velocity  RMSEs  for  targets  when  tracked  by  3 
UAV s  and  one  of  the  UAV s  is  lost. 


between  a  target  and  an  UAV  is  less  than  3  m,  which  may 
be  fatal  if  the  targets  are  hostile.  This  shows  the  importance 
of  considering  target-fire  survival  probability  for  a  practical 
use  of  information  theoretic  approach.  When  this  probability 
is  considered  the  minimum  value  of  target-sensor  distance  is 
always  more  than  1 000  m. 

Usually,  the  UAVs  do  not  come  close  to  each  other,  as  such 
a  configuration  is  not  favorable  for  maximizing  information. 
However,  in  certain  situations,  UAVs  can  cross  each  other’s 
path  and  while  doing  so  they  might  come  very  close,  resulting 
in  a  collision.  In  this  work,  this  possibility  is  avoided  by  using 
the  collision  survival  probability  in  the  criterion  function  in 
(1). 


5.  Conclusions 

In  this  paper,  a  novel  cooperative  control  algorithm,  for  a 
number  of  UAVs  tracking  multiple  targets,  is  presented.  An 
information  theoretic  approach  is  used  for  the  path  selection 
by  the  UAVs,  which  incorporates  target  detection  probability 
and  UAV  survival  probabilities  due  to  hostile  fire  by  targets 
and,  also,  due  to  collision  with  other  UAVs. 

A  scenario  of  10  targets,  which  moves  in  three  groups,  is  sim¬ 
ulated  and  2-4  UAVs  were  deployed  to  track  them,  in  differ¬ 
ent  cases.  The  results  show  that  the  cooperative  control  al¬ 
gorithm  enables  the  UAVs  to  maintain  almost  similar  average 
accuracy  for  different  tracks  and,  also,  the  position  and  ve- 
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locity  RMSE  for  all  tracks  decrease  as  the  number  of  UAVs 
increase.  The  robustness  of  this  algorithm  in  the  face  of  an 
UAV  loss  and  importance  of  different  probabilities  in  the  cri¬ 
terion  function  are  also  demonstrated. 

Currently,  a  joint  maximization  routine  is  used  to  find  out  the 
paths  of  the  UAVs.  This  can  be  computationally  expensive 
if  the  number  of  targets  and  UAVs  increase.  For  this  reason, 
a  prospect  of  using  gradient  directions  as  the  directions  of 
motions  of  the  UAVs  is  being  investigated.  Also,  imposing 
more  constraints  on  sensors’  ability  to  observe  will  be  another 
future  direction  of  our  work. 
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ABSTRACT 

With  the  recent  advent  of  moderate-cost  unmanned  (or  uninhabited)  aerial  vehicles  (UAV)  and  their  success  in 
surveillance,  it  is  natural  to  consider  the  cooperative  management  of  groups  of  UAVs.  The  problem  considered 
in  this  paper  is  the  optimization  of  the  information  obtained  by  a  group  of  UAVs  carrying  out  surveillance 
search  and  tracking  —  over  a  large  region  which  includes  a  number  of  targets.  The  goal  is  to  track  detected 
targets  as  well  as  search  for  the  undetected  ones.  The  UAVs  are  assumed  to  be  equipped  with  Ground  Moving 
Target  Indicator  (GMTI)  radars,  which  measure  the  locations  of  moving  ground  targets  as  well  as  their  radial 
velocities  (Doppler).  In  this  paper,  a  decentralized  cooperative  control  algorithm  is  proposed,  according  to  which 
the  UAVs  exchange  current  scan  and  detection  information  and  each  UAV  decides  its  path  separately  based  on 
an  information  based  objective  function  that  incorporates  target  state  information  as  well  as  target  detection 
probability  and  survival  probability  for  sensors  corresponding  to  hostile  fire  by  targets  and  collision  with  other 
UAVs.  The  proposed  algorithm  requires  limited  communication  and  modest  computation  and  it  can  handle 
failure  in  communication  and  loss  of  UAVs. 

Keywords:  Multisensor-multi  target  tracking,  sensor  management,  cooperative  control,  UAV  placement,  ground 
target  tracking. 


1.  INTRODUCTION 

An  important  application  of  control  theory  is  to  manage  multiple  sensors  such  that  the  information  obtained 
from  the  surveillance  region  is  maximized.  Management  of  multiple  sensors  involves  gathering,  exchanging 
and  fusing  information.  With  the  recent  advent  of  affordable  unmanned  aerial  vehicles  (UAV),  a  considerable 
amount  of  research  effort  has  been  directed  toward  mobile  sensor  management,  wrhich  also  includes  path  planning. 
The  advantages  of  UAVs  include  removal  of  the  risk  to  human  operators,  lower  cost,  smaller  size/weight,  greater 
maneuverability  and  possibility  of  effective  coordination.  For  these  reasons,  in  future,  UAVs  would  be  extensively 
used  in  surveillance,  search /rescue,  communication  and  other  military  and  civilian  applications. 

A  number  of  UAV  management  algorithms  can  be  found  in  the  literature.  In  [13]  a  cooperative  control  algo¬ 
rithm  for  multiple  UAVs  to  simultaneously  reach  a  predetermined  target  location,  which  maximizes  survivability 
of  the  UAVs  due  to  exposure  to  threats  while  adhering  to  fuel  constraint,  is  addressed.  A  hierarchical  decision 
mechanism  is  proposed  in  which  at  team  level  the  estimated  time  until  arrival  is  computed  and  at  UAV  level 
path  planning  is  performed.  In  [3]  a  similar  approach,  which  include  Voronoi  diagram  in  path  planning,  is  used 
for  the  simultaneous  intercept  problem  in  the  presence  of  dynamic  threats.  Similar  approaches  can  be  found  in 
[6,  11,  14].  In  [9]  another  hybrid  control  structure  is  proposed  for  the  simultaneous  intercept  problem.  Here, 
UAV-to-target  allocation  and  time  to  reach  the  targets  are  decided  in  the  central  node,  while  the  path  decisions 
are  taken  locally  in  UAVs. 

In  [15]  a  multi- vehicle  path  planning  problem  in  hostile  environment  is  solved  by  dynamic  programming. 
In  this  centralized  algorithm  path  decisions  are  taken  in  terms  of  connecting  pre-defined  way-points.  Ip  [16]  a 
decentralized  cooperative  search  algorithm  is  proposed  in  which  UAVs  exchange  environment  information  but 
independently  decides  their  paths  by  minimizing  a  convex  combination  of  costs  associated  with  subgoals.  It  can 
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be  noted  that  for  UAV  management  problems  decentralized  algorithms  is  preferred  as  they  have  advantages  of 
graceful  degradation,  scalability  and  modularity  properties  over  the  centralized  ones.  In  [8],  for  a  search  problem, 
dynamic  programming  is  used  to  make  finite  horizon  decisions.  Note  that  our  problem  includes  search  as  well 
as  tracking. 

In  [7]  a  decentralized  sensor  management  algorithm,  based  on  maximizing  information  gain,  is  presented.  In 
this  paper,  an  online  negotiation  arrangement  between  the  sensors  is  used  to  decide  the  assignment  of  known 
targets  to  generate  a  coordinated  sensor-platform  trajectory  for  obtaining  information  about  static  features 
(targets).  In  [5]  a  decentralized  multi-UAV  search  algorithm,  for  a  lost  vehicle  in  stationary  state,  is  proposed. 
This  algorithm  is  synchronous,  according  to  which  each  UAV  negotiates  with  others  by  communicating  the 
expected  measurement  likelihood  for  its  best  path  given  the  measurement  likelihood  of  other  UAVs. 

In  our  previous  work  [17],  which  describes  a  UAV  placement  algorithm,  a  preliminary  version  of  the  objective 
function,  which  is  used  for  UAV  path  planning  in  the  current  work,  is  presented.  However,  in  [17]  only  previously 
detected  targets  are  considered  in  the  objective  function  as  the  goal  of  this  algorithm  is  tracking,  not  searching. 
An  extension  of  this  paper  for  the  dynamic  tracking  scenario  is  presented  in  [18].  Here,  too,  searching  for  new 
target  was  not  the  goal.  Also,  the  UAVs  are  considered  to  be  able  to  scan  a  large  region  in  which  the  probability 
of  detection  is  nonzero. 

In  this  paper,  a  decentralized  cooperative  control  algorithm  is  developed  for  a  number  of  UAVs,  equipped 
with  GMTI  sensors,  to  track  multiple  ground  targets  and  search  for  new  ones  in  a  specified  region.  The  region  is 
divided  into  a  number  of  square  sectors1  and  it  is  assumed  that  each  UAV  can  scan  Ns  sectors  each  time.  In  the 
proposed  algorithm  the  UAVs  exchange  the  information  about  the  sectors  scanned,  measurements  obtained  and 
their  current  kinematic  states.  Each  UAV  decides  the  sectors  it  scans  and  the  path  it  follows  on  its  own.  In  this 
work  these  decisions  are  decoupled  into  two  separate  problems.  An  information  based  objective  function  is  used 
to  select  the  future  path  of  a  UAV  such  that  the  total  information,  obtainable  by  the  sensors  in  the  UAVs  as 
a  group,  corresponding  to  the  detected  targets,  is  maximized.  In  addition,  this  objective  function  incorporates 
possible  information  from  the  undetected  targets,  which  are  included  in  the  form  of  possible  targets  at  the  center 
of  the  sectors.  The  objective  function  used  in  this  paper  takes  into  account  the  detection  probabilities  of  targets, 
which  are  based  on  both  range  and  range  rate,  and  the  survival  probabilities  of  the  sensors  due  to  hostile  fire 
from  targets  and  possible  collision  with  other  UAVs,  in  computing  the  information  for  a  particular  target-sensor 
geometry. 

The  algorithm  proposed  in  this  paper  starts  with  no  prior  information  about  the  number  of  targets  or  their 
positions.  Nonetheless,  it  is  possible  to  incorporate  prior  information,  if  available.  The  simulation  results  show 
that  the  algorithm  enables  a  group  of  UAVs  to  gather  information  in  a  cooperative  manner  in  the  region  of  interest 
and  to  detect  new  targets  as  they  appear.  The  computational  complexity  and  communication  requirements  of 
this  algorithm  are  modest  enough  for  the  real  time  applications  related  to  small  UAVs  with  limited  computing 
power.  Finally,  because  of  its  decentralized  approach  the  algorithm  is  robust  and  hence,  it  does  not  fail  in  the 
eventuality  of  loss  of  some  UAVs  or  communication  failure. 

The  paper  is  organized  as  follows.  Development  of  the  objective  function  for  path  planning  is  discussed  in 
Section  2.  In  Section  3  the  cooperative  control  algorithm  is  described  in  detail.  Simulation  results  are  presented 
in  Section  4  and  Section  5  presents  the  concluding  remarks. 

2.  OBJECTIVE  FUNCTION  FOR  PATH  PLANNING 

One  of  the  most  important  stages  in  the  development  of  a  sensor  management  algorithm  is  the  choice  of  objective 
function  which  requires  to  be  easy  to  compute  and  the  decisions  based  on  this  function  should  enable  the  algorithm 
to  achieve  sensor  management  goals.  In  this  work  we  consider  the  following  objective  function 

J(fc)-^(ln|/i(fc|fc)|-hi|/i(*|*-l)|)+  J2  (ln|/m,n(A:)|-ln|/^n|)  (1) 

j  {m,n}€5^ 


*Note  that  it  is  possible  to  incorporate  any  other  shapes  of  sectors  in  this  algorithm. 


where  Ij(k\k  —  1)  and  /j(A;|A;)  are  the  predicted  and  updated  information  matrices  for  a  target  j  at  time  step  k, 
Im,n{k)  is  the  expected  information  matrix  of  new  targets  in  sector  {m,  n}  of  the  surveillance  region  and  Sp  is 
the  set  all  sectors  in  which  there  are  no  currently  tracked  targets;  I^  n  is  the  prior  information  in  sector  {m,  n} 
where  the  uncertainty  is  the  same  as  that  of  a  target  that  can  be  anywhere  in  the  sector  and  move  at  any  possible 
velocity. 

The  information  matrices  in  (1)  are  in  the  Fisher  sense  which  means  they  are  the  inverses  of  the  corresponding 
covariance  matrices.  J  is  the  total  information  gain  related  to  all  targets  under  independence  and  equal  weight 
assumptions  [12].  Also,  each  term  of  J  is  the  difference  between  the  information  after  update  and  before 
update.  The  individual  information  in  (1),  which  is  the  (natural)  logarithm  of  determinant  of  the  corresponding 
information  matrix,  is  similar  to  the  negative  of  Shanon’s  entropy,  ignoring  a  constant,  under  the  assumption 
of  Gaussian  error.  As  entropy  is  the  uncertainty  associated  with  a  random  variable,  it  makes  sense  to  use  the 
negative  of  entropy  as  information.  It  can  be  noted  that  we  deviate  from  the  “negative  of  expected  value  of 
logarithm  of  the  density  function”  definition  to  a  more  convenient  “logarithm  of  the  determinant  of  information 
matrix”  definition  of  information.  Both  definitions  result  in  similar  functions  for  a  Gaussian  density. 

Let  us  consider  that  the  predicted  covariance  of  a  target  is  P  (denoted  without  time  argument,  for  simplicity) 
in  a  linear  Gaussian  system.  Also,  there  is  a  probability  i to  of  this  target  being  detected  at  this  instant.  Hence 
the  updated  covariance  is  given  by 

p  _  (  P  —  PHf(HPHf  +  R)-1  HP  if  the  target  is  detected 
1  P  otherwise 


where  H  and  R  are  the  measurement  matrix  and  the  measurement  covariance  matrix,  respectively.  The  expected 
updated  covariance  matrix  is 

P  =  P  -  7vdPH'{HPH'  -f  R)-lHP  (3) 

where  the  detection  event  is  assumed  to  be  independent  of  the  measurement  or  process  noise.  However,  in  this 
case  the  information  matrix  does  not  have  a  simple  form  similar  to  P^1  -F  Hl R~l H  unless  ivD  takes  one  of  the 
extreme  values.  Since  it  is  much  more  convenient  to  work  with  a  form  in  which  total  matrix  information  is 
summation  of  the  information  from  different  (independent)  sources,  in  this  work  we  use  the  expected  updated 
information  given  by 

J  =  /  +  7T£)Hr  R~*  H  (4) 

where  I  and  I  are  the  predicted  information  and  expected  updated  information,  respectively.  Note  that  the 
expected  updated  covariance  is  given  only  approximately  by  the  inverse  of  the  expected  updated  information. 

In  our  problem  the  UAVs  scan  the  targets  asynchronously  which,  in  general,  leads  to  a  complicated  objective 
function.  However,  if  the  offset  times  are  small,  as  in  our  case,  the  target-sensor  geometry  does  not  change  much 
from  the  scan  by  one  UAV  to  that  by  another  one.  Hence,  for  convenience,  the  objective  function  is  constructed 
assuming  synchronous  scans  by  all  sensors. 

The  detection  probability  of  target  j  by  sensor  s  at  time  step  k  is  written  as 

7 TD(k,  s,j)  =  7 rlD(k,  s,j)ir2D(k,  s,j)  (5) 


where  7t^(A:,  s,  j)  is  the  detection  probability  (of  target  j  by  UAV  s  at  time  k)  as  a  function  of  the  range  and 
^(JkjSj’)  is  the  detection  probability  as  a  function  of  the  range  rate.  The  first  term  depends  on  the  specific 
application  and  the  second  term  is  the  probability  that  the  range  rate  measured  by  the  GMTI  sensor  is  higher 
than  “minimum  detectable  velocity”  (MDV),  discussed  later  in  this  section. 

The  total  survival  probability  of  UAV  s,  which  is  equal  to  the  product  of  target-fire  survival  probability 
7r s(k,s)  and  collision  survival  probability  i r|(fc,  s)  of  this  UAV,  i.e., 

tts(M)  =  fls(fc,s)7r!(fc,s)  (6) 


where  71*5  (fc,  s),  in  turn,  is  the  product  of  target-fire  survival  probabilities  of  UAV  s  in  view  of  each  target,  i.e., 


7Ts(fc,s)  =  T  7Ts(*,  S,j) 


(7) 


(8) 


and  7r|(A:,  5)  is  the  product  of  collision  survival  probabilities  corresponding  to  all  other  UAVs 

TTs{k,s)  =  JJ  nl(k,s,i) 

M/a  * 

The  nature  of  these  survival  probabilities  is  application  dependent. 

The  combined  probability  that  sensor  s  contributes  to  the  information  about  target  j  at  time  k  is  given 
by  7ro{k,  s,  j)ns(k,s).  The  information  obtained  by  a  particular  UAV  s  about  target  j  at  time  k  is  given  by 
H(k,s,j)'R{k,s,j)-lH(k,s,j)  where  H(k,s,j)  is  the  measurement  matrix  and  R(k,s,j)  is  the  measurement 
noise  covariance  matrix  corresponding  to  the  sensor-target  pair  s,j. 

The  approximate  expected  information  matrix  of  target  j ,  following  (4),  due  to  the  scans  by  all  sensors,  is 
given  by 


I3(k\k)  =  Ii{k\k-l)  +  ^iC8(k,s)KD(k,aJ)H(k,s,jYRlk,sJ)-lH{k,s,j)  (9) 

s 

where  Ij(k\k  —  1)  =  Pj(k\k  —  l)"1.  Note  that  in  the  above  equation  the  survival  event,  detection  event  and 
measurement  errors  are  assumed  independent. 

For  the  sectors  that  do  not  contain  any  of  the  currently  tracked  targets,  new  (undetected)  targets  are  the 
possible  source  of  information.  The  prior  information  for  sector  {m,  n},  which  is  discussed  before  in  this  section, 
is  denoted  by  /^  n.  The  new  information  obtained  by  a  sensor  s  from  sector  {m,  n}  at  time  step  k  is  zero 
if  there  is  no  detection  and  JT(/c,s,m,  s,  m,  n)"1  H(k,s,  m,  n)  if  a  detection  occurs,  where  H(k,  s,m,n) 

is  the  measurement  matrix  and  R(k,s,m,n)  is  the  measurement  covariance  matrix  corresponding  to  the  new 
target.  Note  that  the  position  of  a  new  target  is  assumed  to  be  at  the  center  of  the  sector.  The  probability  of  a 
new  target  detection  event  in  sector  {m,  n}  is  given  by  the  product  of  the  probability  of  detection  i fo(k,  s,  m,  n) 
and  the  probability  of  the  presence  of  a  new  target  7fncw(A:,  m,n)  in  this  sector,  i.e., 

7TD,ncw(fc,  s,  m,  n)  =  7 tD(k,  $,  m,  n)7rncw(fc,  m,  n)  (10) 

where  the  term  ^(/c,  s,  m,  n)  is  similar  to  ^(A:,  s,j).  The  only  difference  is  that  the  former  is  a  function  of 
the  sensor  range  from*  the  center  of  a  sector  while  the  latter  is  a  function  of  the  range  between  the  sensor  and 
target.  Note  that  here  the  factor  for  the  range  rate  is  considered  to  be  unity  as  no  realistic  assumption  of  this 
quantity  can  be  made  for  an  undetected  target.  The  second  term  in  the  right  hand  side  of  (10),  which  represents 
probability  of  presence  of  a  new  target,  is  defined  in  Section  3. 

As  discussed  before,  the  expected  information  matrix  from  sector  {m,  n}  when  scanned  by  sensor  s  at  time 
step  k  is  taken  as 

Im.n(k,s)  =  I^ln+TtDinrw(k,s.m,n)H(k,s,Tn,n)'R(k,s,m,n)~lH(k,s,m,n)  (11) 

In  this  work,  in  order  to  restrict  more  than  one  UAV  from  moving  toward  the  same  unscanned  sector,  only  the 
maximum  information  form  all  sensors  is  included  in  the  cost  function  instead  of  considering  the  summation  of 
information,  i.e., 

im,n(k)  =  max  In  \Im,n(k.s)\  (12) 

The  UAVs  are  assumed  to  be  equipped  with  GMTI  sensors  and  the  state  vector  of  target  j  is  taken  as 

X  =  [  Xs  vi  v{  ]  (13) 

where  x,  y  are  the  position  components  in  Cartesian  coordinates  and  rx,  vy  are  the  velocity  components.  The 
measurement  vector  comprises  of  x-y  position  and  radial  velocity  r.  The  measurement  matrix,  of  target  j 
corresponding  to  sensor  s,  is  given  by 

10  0  0 
0  0  10 

0  cosa(fc,s,jf)  0  sina(fc,  s,j) 


H{k,s,j)  = 


(14) 


where  ot(k,s,j)  is  the  azimuth  angle  of  the  target  j  measured  by  sensor  s  at  time  tk .  The  original  position 
measurements  are  in  the  form  of  range  r(fc,  s,  j)  and  azimuth  a(k ,  s,  j )  in  the  presence  of  noise,  which  are  converted 
to  position  using  the  standard  conversion  [1].  The  original  position  measurements  in  range,  azimuth  angle 
and  range  rate  contains  independent  additive  Gaussian  noise  and  the  corresponding  noise  variances  are  given  by 
<x2,  a2  and  cr?,  respectively.  After  conversion,  the  measurement  covariance  matrix  R(k,s,j)  is  given  by 


where  (skipping  the  arguments) 


R(k,s,j) 


R\,i  R\,2  0 

R\,2  ^2,2  0 

0  0  crj 


R\,\  =  r2<r2  sin  a2 -f  <r2  cos  a2 

R22  =  r2<r2  cosq2  -f  sina2 

R\}2  —  (<r2  -  r2a2)  sin  a  cos  a 


(15) 


(16) 

(17) 

(18) 


As  shown  in  [19]  one  can  use  the  observed  azimuth  in  (14)  to  allow  a  linear  model  for  the  range  rate 
measurement.  Note  that  the  computation  of  R(k,  s,m,n)  is  similar  to  that  of  R(k,  s,j).  For  the  former,  the 
position  of  target  is  assumed  to  be  at  the  center  of  the  sector. 

For  a  GMTI  radar,  if  the  magnitude  of  the  measured  value  of  the  range  rate  for  a  target  falls  below  a  threshold 
fmin  t^ien  target  will  not  be  detected.  Hence,  7r£>(A:,  s,  j)  is  given  by 

*l>(k,s,j)  =  1  -  P{— rmin  <  r(k,s,j)  <  rmin  \  r(k,sj\k  -  l).af(k,s,j\k  -  l)2}  (19) 

where  r(/c,  s,  j)  is  the  measured  range  rate  and  r(k,  s,  j\k  —  1)  is  the  predicted  range  rate  given  by 

r(k,  s,  j\k  -  1)  =  vl(k\k  -  1)  cos  a(k,  s,j\h-  1)  4-  v^(k\k  -  l)sina(A:,s,j,  \k  -  1)  (20) 

where  vl(k\k  —  1),  v^(k\k  —  1)  are  the  components  of  predicted  velocity  and  a(it,  s,  j,  |/c  —  1)  is  the  predicted 
azimuth  angle  for  target  j . 

The  variance  term  Of{k,  s,  j\k  —  l)2  in  (19)  is  the  range  rate  measurement  prediction  variance  which  is  the 
third  diagonal  term  of  the  innovation  covariance  matrix 

S(k,s,j)  =  Hfas^Pjiklk-  l)H(k,s,j)f  +  R(k,s,j)  (21) 

where  in  H(k,  s,j)  the  measured  azimuth  angle  a(/c,  s,  j)  is  replaced  by  the  predicted  azimuth  angle  a(k,  s,j\k—  1). 

Hence  ^(A;,  s,j)  in  (19)  is  evaluated  by  integrating  a  Gaussian  density  with  mean  r(k,  s,  j\k  —  1)  and  variance 
<jf(k,  s,j\k  —  l)2  in  the  interval  —  rm^n  to  T-m[n  and  subtracting  the  result  from  unity.  This  is  available  from  the 
tabulated  standard  error  function. 


3.  ALGORITHM 

In  this  section,  the  algorithm  used  by  each  UAV,  which  results  in  cooperative  control  of  the  UAVs  as  a  group,  is 
discussed  in  detail.  This  algorithm  is  capable  of  tracking  detected  targets  and  to  search  for  undetected  ones  by 
taking  into  account  that  new  targets  can  start  from  the  regions  already  been  scanned.  The  surveillance  region  is 
divided  into  sectors  and  it  is  assumed  that  each  UAV  is  capable  of  scanning  Ns  sectors  in  each  time  period  T.  In 
this  algorithm  the  decisions  on  path  selection  and  sectors  to  scan  are  taken  separately  by  the  UAVs.  However, 
these  decisions  are  indirectly  connected  as  a  scan  in  a  particular  sector  reduces  the  information  available  from 
this  sector  which  in  turn  affects  the  next  path  decision. 

In  this  work  the  probability  of  existence  of  a  new  target  in  a  sector  {m,  n},  which  is  required  to  compute 
information  from  new  sectors  as  shown  in  (10),  is  assumed  to  have  the  following  form 


7T ncw(£,  m ,  n)  =  7Tmax(TO:  n)  ( 1  -  e  (tfc  5m  "(fc))/A”* 


(22) 


Figure  1.  Flow  chart  of  the  tasks  of  each  UAV. 

where  tk  is  the  scan  time  at  step  k ,  7rmax(m,  77)  is  the  maximum  value  of  the  probability  of  new  target,  sm,n  is 
the  equivalent  last  scan  time  and  A miTl  is  a  parameter  that  defines  the  rate  of  increase  of  the  probability  of  a 
new  target  after  a  scan  in  sector  {m,  n}.  This  models  the  "‘appearance”  of  a  new  target  since  the  last  scan  of 
the  sector  under  consideration.  The  parameters  7fmax(ra, n)  and  Am,n  are  application  dependent  which  can  be 
chosen  to  give  different  importance  to  different  sectors.  The  equivalent  last  scan  time  sm(n  is  updated  after  a 
scan  by  sensor  s  as  follows 

{Sm,n(k)  if  not  scanned 

tk  if  scanned  and  target  detected  (23) 

tk  —  A(fc,  s,  77i,  n,  s)  if  scanned  and  target  not  detected 

where  /c+  denotes  time  immediately  after  the  scan  at  tk  and  A (k,  s,  m,  n)  is  given  by 

A(k,  s,  771, 71 )  =  —  Am  n  In  Jl—  ^1  —  e~(tk~  ”  [1  -  nD(ki  5,77i,7i)]J 


(24) 


In  this  algorithm  each  UAV  stores  the  equivalent  last  scan  time  of  each  sector  to  keep  track  of  the  new  target 
probability,  which  needs  update  only  if  the  corresponding  sector  is  scanned.  The  derivation  of  (24)  is  discussed 
in  Appendix. 

Each  UAV  performs  the  tasks  shown  in  Figure  1  asynchronously  w.r.t.  the  other  UAVs.  Each  UAV  scans  Na 
sectors  at  an  interval  of  T  using  a  GMTI  sensor.  The  decision  about  which  sectors  to  scan  is  taken  separately 
from  the  path  decision  of  this  UAV  and  it  does  not  depend  on  the  state  of  the  other  UAVs.  The  information 
from  each  section  is  computed  and  the  best  Ns  sections  are  chosen,  on  that  basis,  for  this  scan. 

A  function  similar  to  (1)  is  used  for  search  decision.  The  only  difference  is  that  the  information  contribution 
from  the  other  sensors  is  not  considered.  If  there  are  Vm?n(/:)  targets  in  sector  {m,n}  at  time  step  k  then  the 
modified  information,  corresponding  to  UAV  s,  form  this  sector,  is  given  by 

AU.nM 

/m,n(fc,s)  =  ln|P,(k|fc- l)-1  +ns(h,  «)«•£>(*,  s,j)H(k,s,j)'R(k,s,j)~lH(k,s,j)\ 

j=i 

—  In  \Pj(k\k  —  l)-1 1  (25) 

where  Pj(k\k  —  1)  is  the  error  covariance  matrix  corresponding  to  the  predicted  state  of  target  j ,  H(k,  6,  j)  is 
the  measurement  matrix,  s,  j)  is  the  measurement  covariance  matrix,  ns(k,  s)  is  the  survival  probability  of 
UAV  s  and  7T£>(fc,  s,  j)  is  the  target  detection  probability.  If  sector  {m,  n)  does  not  contain  any  of  the  currently 
scanned  targets  then  modified  information  from  this  sector  is  given  by 

4n,n(M)  =  ln  Cin  +  KD<nCw{k,  s,  m,  n)H (k,  s,m,  n)' R(k,  s,  m,  ti)*1  H (k,  s,m,  n)\  -  In  |/^,n  |  (26) 

where  n  is  the  prior  information  for  a  sector  without  target,  nz>^ncw(k,  s.m.n)  is  the  new  target  detection 
probability,  H(k,  s,  m,  n)  is  the  measurement  matrix  and  R(k,s,  m,  n)  is  the  measurement  covariance  matrix  for 
a  target  at  the  center  of  the  sector. 

After  scanning  N9  sectors  for  targets,  the  detections,  equivalent  last  scan  time  matrix  and  UAV’s  current  state 
are  transmitted  to  the  other  UAVs.  There  is  a  7 rc  probability  that  another  UAV  will  receive  this  transmission. 
Each  UAV  maintains  its  own  set  of  target  tracks,  which  are  updated  when  a  new  set  of  detections  is  either 
obtained  by  the  corresponding  UAV  or  received  from  another  UAV. 

After  scanning  for  measurements,  transmitting  them  and  updating  the  target  tracks,  each  UAV  determines 
its  path  for  the  next  interval  T.  For  a  coordinated  operation  this  decision  depends  on  the  corresponding  UAVs 
knowledge  about  the  current  locations  of  the  other  UAVs,  the  current  state  of  the  target  tracks  maintained  by 
it  and  equivalent  last  scan  times  of  the  sectors.  The  objective  function  J  in  (1)  is  maximized  to  obtain  the 
paths  of  the  UAVs.  In  this  sense,  each  UAV  works  like  a  central  node  which  increases  the  total  computational 
requirement  of  the  system.  However,  this  way  the  decentralized  system  avoids  possible  online  negotiation  which 
can  be  disastrous  in  the  presence  of  communication  problems.  Also,  for  a  large  system,  computational  load  of 
each  UAV  can  be  reduced  by  considering  only  a  small  region  around  it. 

The  knowledge  of  the  state  of  another  UAV  is  updated  when  information  is  received  from  that  UAV.  In  case 
of  a  failure  in  communication,  the  path  decided  for  the  particular  UAV  in  the  last  iteration  is  assumed  to  be  its 
actual  path.  If  no  information  is  received  from  a  particular  UAV  for  a  number  of  times,  the  corresponding  UAV 
is  considered  to  be  lost  and  future  decisions  are  taken  without  considering  it. 

After  setting  its  course  for  next  interval  T  each  UAV  then  waits  for  any  transmission  from  the  other  UAVs 
as  shown  in  Figure  1.  If  a  new  set  of  detections  and  scan  times  are  received  from  another  UAV,  then  the  target 
tracks,  maintained  by  this  UAV,  are  updated  and,  also,  the  equivalent  last  scan  times  are  updated,  where  the 
new  scan  time  for  a  sector  is  the  maximum  of  the  old  scan  time  and  the  received  scan  time.  After  T  seconds  the 
UAV  once  again  scans  Ns  sectors  with  its  GMTI  sensor  and  so  on. 


Figure  2.  The  paths  of  six  targets  in  the  surveillance  region  (x  -  initial  positions). 

4.  SIMULATION  RESULTS 

In  this  section,  we  present  the  simulation  results  obtained  for  a  58  min  scenario  in  which  the  surveillance  region 
is  40  km  x  40  km  and  it  includes  6  targets  as  shown  in  Figure  2.  The  targets,  which  include  maneuvering  and 
move-stop- move  types,  appear  in  the  surveillance  region  at  different  times.  The  first  three  targets  start  at  the 
beginning  of  simulation  (time  0).  Target  1  moves  in  a  straight  line  on  a  course  of  90°  (parallel  to  the  x-axis) 
with  a  speed  10  m/s  and  it  exits  the  surveillance  region  at  42  min.  Target  2  moves  at  a  speed  7.3  m/s  on  a  course 
of  75°  and  turns  30°  to  the  right  at  time  33  min  20  s  and  then  moves  straight.  Target  3  moves  straight  on  a 
course  of  45°  with  a  speed  7.1  m/s.  Targets  2  and  3  continue  to  move  inside  the  surveillance  region  during  the 
whole  simulation.  Target  4  enters  the  surveillance  region  at  25  min  close  to  the  the  upper  left  comer  with  a 
speed  lOm/s  on  a  course  of  180°  and  performs  a  90°  right  turn  at  38 min  20s  and  then  continues  straight.  It 
exits  the  surveillance  region  at  55  min  32  s.  Target  5  moves  in  the  central  part  of  the  surveillance  region  from 
16  min  40  s  to  41  min  40  s  and,  also,  it  stops  for  5  min  at  23  min  20  s.  Target  6  moves  in  the  upper  right  part  of 
the  surveillance  region  from  33  min  20  s  to  46  min  40  s  and  turns  35°  in  during  a  maneuver.  For  all  the  targets 
the  turn  rates  of  the  maneuvers  are  l°/s. 

The  number  of  UAVs  deployed  in  different  simulations  vary  between  3  to  6  and  each  of  them  starts  at  y 
position  of  -12  km  while  keeping  a  distance  of  1km  from  the  closest  ones  along  the  x  direction.  Initially,  the 
UAVs  move  at  a  rate  of  30  m/s  on  a  course  of  360°  (along  the  +y  direction).  For  this  simulation,  T  is  5s,  which 
means  each  UAV  performs  the  set  of  tasks  shown  in  Figure  1  within  that  time.  The  surveillance  region  is  divided 
into  4  km  x  4  km  sectors  and  it  is  assumed  that  each  UAV  can  scan  10  such  sectors  each  time.  The  UAVs  start 
with  no  knowledge  about  the  targets  and  each  UAV  decides  on  its  path  by  maximizing  J  in  (1)  based  on  its 
knowledge  of  the  positions  of  the  other  UAVs,  target  tracks  and  last  scan  times  of  the  sectors  known  to  the 
corresponding  UAV.  The  success  probability  of  a  communication  between  two  UAVs,  which  is  denoted  by  nc  in 
Section  3,  is  0.9. 

In  this  simulation  one  point  track  initialization  [19]  is  applied  and  track  maintenance  is  performed  by  a 
two  stage  procedure:  measurement  to  track  association,  which  is  performed  by  the  auction  algorithm  [4],  and 
track  update  using  a  Kalman  filter.  A  white  noise  acceleration  model  is  assumed  for  the  targets  with  process 
noise  standard  deviation  (s.d.)  being  lm/s2.  The  measurement  noise  s.d.  are  ar  =  10  m,  =  10  3  rad  and 
Gf  —  1  m/s.  The  number  of  false  alarms  in  each  sector,  when  scanned,  follows  a  Poisson  distribution  with  mean 
0.1  and  the  false  measurements  are  uniformly  distributed  in  the  sector. 

Commonly  for  the  tracking  algorithms  presented  in  the  literature,  a  track  is  deleted  if  it  is  not  associated 
with  measurements  for  more  than  a  predetermined  number  of  updates.  However,  this  rule  is  not  based  on  the 


observability  criterion  and  may  result  in  deletion  of  tracks  because  they  are  unobservable  by  the  sensors  used. 
To  avoid  this,  in  this  work,  a  quantity  7rtrack  is  updated  each  time  a  set  of  detections  is  received,  as  follows 


TTtrack  (k  +  1)  =  |  7FtraCk(fc)1(1  “  ^ 


if  not  associated 
otherwise 


(27) 


where  7 vD  is  the  probability  of  detection  of  the  particular  track.  If  7rtrack  falls  below  a  predetermined  level,  10“  6 
in  this  simulation,  the  track  is  deleted  from  the  track  list. 

In  this  simulation,  the  altitude  of  the  UAVs  above  the  ground  is  considered  to  be  1  km  and  the  ground  is 
considered  to  be  flat.  The  UAVs  fly  at  a  constant  speed  of  30  m/s  and  can  perform  coordinated  turns  with 
angular  turn  rate  upto  l°/s.  To  set  its  path  for  the  next  period,  each  UAV  decides  its  angular  turn  rate  by 
maximizing  J  in  (1).  Since  J  also  includes  the  angular  turn  rates  of  the  other  UAVs,  these  quantities  are  also 
determined  in  a  joint  maximization  procedure.  The  Matlab  function  ‘fmincon?  is  used  to  perform  this  constrained 
optimization.  The  angular  turn  rates  of  the  other  UAVs  are  not  transmitted  as  each  UAV  performs  this  operation 
independently. 

For  each  sector  {m,  n}  the  UAV  management  algorithm  needs  to  assume  the  rate  at  which  the  probability 
of  a  new  target  increases  if  the  sector  is  not  scanned,  denote  by  Am,n  in  (22),  and  the  maximum  value  of  the 
probability  of  a  new  target,  denoted  by  7fmax(ra,  n).  In  this  simulation  these  parameters  are  assumed  to  be  5  min 
and  10“4,  respectively,  for  all  sectors. 
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Figure  3.  Detection  and  survival  probabilities  assumed  in  this  simulation. 

Figure  3  shows  the  survival  probabilities  and  detection  probability  as  a  function  of  range,  as  used  in  this 
simulation.  It  can  be  seen  that  the  survival  probabilities  increase  as  the  target-sensor  distance  and  sensor-sensor 
distance  increase.  However,  the  detection  probability  w.r.t.  range  increases  as  the  distance  between  target-sensor 
decreases.  It  is  important  to  note  that  in  a  real  life  scenario  the  survival  and  detection  probabilities  depend 
on  the  terrain  topography  and  the  algorithm  proposed  in  this  work  is  applicable  for  any  assumption  on  these 
probabilities.  The  detection  probability  factor  as  a  function  of  the  range  rate  of  a  target  w.r.t.  a  sensor  is  a  step 
function  being  1  if  the  target  range  rate  magnitude  is  more  than  2m/s  and  0  otherwise. 

Figures  4  and  5  show  8  snapshots  in  a  typical  simulation  using  5  UAVs  where  triangular  shapes  represent 
the  positions  of  the  UAVs  and  their  directions  of  motion  are  along  the  sharp  comers.  In  these  figures  black  lines 
represent  current  tracks  and  blue  stars  represent  the  last  updated  position  of  the  corresponding  targets.  Also, 
the  yellow  patches  denote  sectors  scanned  by  the  UAVs  in  the  last  period  and  the  blue  patches  denote  sectors 
not  scanned  for  more  than  1  min.  Note  that  in  this  case  the  equivalent  last  scan  time  is  used  as  discussed  in 
Section  3.  The  first  of  the  snapshots,  which  shows  the  positions  of  the  tracks  and  UAVs  at  7  min,  consists  of  a 
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Figure  4.  The  first  four  snapshots  showing  UAV  positions,  target  tracks  and  sectors  scanned  in  a  typical  simulation 
using  five  UAVs. 


T>n*  n  35  mfriute* 


Ttete  *  42  minutes 


Figure  5.  The  last  four  snapshots  showing  UAV  positions,  target  tracks  and  sectors  scanned  in  atypical  simulation  using 
five  UAVs. 

large  number  of  blue  sectors  because  the  UAVs,  which  start  near  the  lower  left  comer,  have  not  yet  reached  these 
sectors.  From  the  next  snapshot  onwards  there  are  very  few  blue  sectors  as  each  sector  is  repeatedly  scanned 
even  if  none  of  the  current  targets  belong  there.  Figure  6  shows  the  total  paths  of  the  UAVs,  which  start  from 
the  lower  left  of  the  surveillance  region,  and  the  tracks  obtained  by  UAV  l.2  In  this  figure  it  can  be  seen  that 
all  of  the  targets  are  tracked  for  their  complete  duration  of  path  inside  the  surveillance  region,  after  they  are 
detected. 


2Note  that  in  our  simulations  due  to  imperfect  communication  tracks  obtained  by  different  UAVs  are  not  identical. 
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Figure  6.  The  complete  path  of  the  UAVs  and  the  tracks  of  targets  in  the  typical  simulation  using  5  UAVs. 


5.  CONCLUSIONS 

In  this  paper,  a  novel  cooperative  control  algorithm,  for  a  number  of  UAVs  searching  and  tracking  multiple 
targets  in  a  large  region,  is  presented.  According  to  this  algorithm  each  UAV  broadcasts  its  current  scan  and 
detection  information  and  decides  its  path  separately  according  to  an  information  theoretic  objective  function, 
which  incorporates  target  state  information  as  well  as  target  detection  probability  and  UAV  survival  probabilities 
due  to  hostile  fire  by  targets  and,  also,  due  to  collision  with  other  UAVs. 

A  simulated  scenario  consists  of  a  40  km  x  40  km  surveillance  region,  6  targets,  which  includes  maneuvering 
and  move-stop-move  types,  and  3-6  UAVs,  in  different  cases.  The  results  show  that  the  cooperative  control 
algorithm  enables  the  UAVs  to  maintain  almost  similar  average  accuracy  for  different  tracks.  Also,  as  the 
number  of  UAVs  increase  the  target  state  (containing  position  and  velocity  components)  estimation  RMSE  and 
target  detection  delays  for  all  tracks  decrease  while  the  continuity  of  tracks  improve. 

Appendix 

The  derivation  of  the  update  of  equivalent  scan  time  sm>n  for  sector  {m,  n}  is  discussed  in  this  section.  The 
probability  that  there  is  a  undetected  target  after  a  scan  by  sensor  s  in  sector  {m,  n}  is  equal  to  the  product  of 
the  probability  of  a  new  target  in  this  sector  before  the  scan  and  the  probability  of  no  detection  in  this  sector, 
i.e. , 

TTncw  (fc+  ,m,n)  =7 r  ncw(k,  my  n)  (1  -  s,  m,  n))  (28) 

where  denotes  the  time  immediately  after  the  scan  at  tk  and  7rncw(k,Tn,  n)  is  given  in  (22).  As  discussed 
before  the  probability  of  detection  corresponding  to  a  sector  is  assumed  to  be  equal  to  that  probability  at  the 
center  of  the  sector.  Next,  7rnew(A:'1",  m,n)  can  be  expressed  in  terms  of  the  updated  equivalent  last  scan  time  in 
sector  {m,  n}  as  follows 

TTnew (fc+,m,n)  =  7fmax(™,™)  (l  “  e_(tfc‘Sm  ri(fc+))/Am  n^  (29) 

The  equation  (24)  is  obtained  by  substituting  7rnew (fc+,m,  n)  by  (28)  and  then  solving  for  sm,n(fc+). 
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ABSTRACT 

In  this  paper  we  propose  a  cooperative  control  algorithm  for  a  group  of  UAVs  carrying  out  surveillance  — 
search,  tracking  and  classification  —  over  a  large  region  which  includes  a  number  of  targets.  The  goal  is  to 
track  and  classify  detected  targets  as  well  as  search  for  new  targets.  The  UAVs  are  assumed  to  be  equipped 
with  Ground  Moving  Target  Indicator  (GMTI)  radars,  which  measure  the  locations  of  moving  ground  targets 
as  well  as  their  radial  velocities  (Doppler).  In  addition,  a  classification  sensor  is  mounted  on  each  UAV  that  can 
obtain  target  class  information.  The  surveillance  region  is  divided  into  a  number  of  sectors  and  it  is  assumed 
that  the  GMTI  sensor  on  each  UAV  scans  a  fixed  number  of  such  sectors  in  each  period  of  its  operation.  The 
sensor  responsible  for  class  information  can  scan  only  a  small  circular  region  around  the  predicted  position  of 
a  target.  In  this  paper,  a  decentralized  cooperative  control  algorithm  is  proposed,  according  to  which  each 
UAV  transmits  the  current  scan  information  (either  kinematic  or  class  information)  and  detection  information 
(including  ’negative  information1 )  to  the  other  UAVs.  Each  UAV  makes  its  scan  decision  and  path  decision 
separately,  based  on  information-based  objective  functions,  which  incorporate  target  state  information  as  well 
as  target  detection  probability  and  survival  probability  due  to  possible  hostile  fire  by  targets  and  collision  with 
other  UAVs.  The  proposed  algorithm  requires  limited  communication  and  modest  computation  and  it  can  handle 
failure  in  communication  and  loss  of  UAVs. 

Keywords:  target  classification,  sensor  management,  cooperative  control,  UAV  path  planning,  ground  target 
tracking. 


1.  INTRODUCTION 

A  number  of  UAV  management  algorithms  can  be  found  in  the  literature.  In  [14]  a  cooperative  control  algorithm 
for  multiple  UAVs  to  simultaneously  reach  a  predetermined  target  location,  which  maximizes  the  survivability 
of  the  UAVs  due  to  exposure  to  threats  while  adhering  to  fuel  constraint,  is  addressed.  A  hierarchical  decision 
mechanism  is  proposed  in  which  at  team  level  the  estimated  time  until  arrival  is  computed  and  at  UAV  level 
path  planning  is  performed.  In  [3]  a  similar  approach,  based  on  Voronoi  diagram  in  path  planning,  is  used  for 
the  simultaneous  intercept  problem  in  the  presence  of  dynamic  threats.  Similar  approaches  can  be  found  in 
[6,  15].  In  [10]  another  hybrid  control  structure  is  proposed  for  the  simultaneous  intercept  problem  in  which 
UAV- to- target  allocation  and  time  to  reach  the  targets  are  decided  in  the  central  node,  while  the  path  decisions 
are  taken  locally  in  UAVs. 

In  [17]  a  multi- vehicle  path  planning  problem  in  hostile  environment  is  solved  by  dynamic  programming. 
In  this  centralized  algorithm  path  decisions  are  taken  in  terms  of  connecting  pre-defined  waypoints.  In  [18]  a 
decentralized  cooperative  search  algorithm  is  proposed  in  which  UAVs  exchange  environment  information  but 
independently  decide  their  paths  by  minimizing  a  convex  combination  of  costs  associated  with  subgoals.  It  can 
be  noted  that  for  UAV  management  problems  decentralized  algorithms  are  preferred  as  they  have  advantages  of 
graceful  degradation,  scalability  and  modularity  properties  over  the  centralized  ones.  In  [9],  for  a  search  problem, 
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dynamic  programming  is  used  to  make  finite  horizon  decisions.  Note  that  our  problem  includes  search,  tracking 
and  classification. 

In  [8]  a  decentralized  sensor  management  algorithm,  based  on  maximizing  information  gain,  is  presented. 
An  online  negotiation  arrangement  between  the  sensors  is  used  to  decide  the  assignment  of  known  targets  to 
generate  a  coordinated  sensor-platform  trajectory  for  obtaining  information  about  static  features  (targets).  In  [5] 
a  decentralized  multi-UAV  search  algorithm,  for  a  lost  vehicle  in  a  stationary  state,  is  proposed.  This  algorithm  is 
synchronous,  according  to  which  each  UAV  negotiates  with  others  by  communicating  the  expected  measurement 
likelihood  for  its  best  path  given  the  measurement  likelihood  of  other  UAVs. 

An  important  objective  of  the  surveillance  system  is  to  classify  the  detected  targets.  The  problem  of  optimal 
search  for  joint  detection  and  classification  of  a  stationary  target  was  first  considered  in  [21],  which  assumes 
separate  sensors  for  these  objectives.  It  is  shown  that  the  optimal  plan  is  to  allocate  classification  effort  immedi¬ 
ately  after  targets  are  detected.  In  [7]  continuous  search  of  multiple  targets,  which  belong  to  multiple  classes,  is 
considered,  where  the  probability  of  detection  and  classification  is  a  function  of  the  search  effort.  In  [12]  design 
of  search  patterns  in  the  presence  of  false  contacts  is  presented  where  classification  is  modeled  as  am  uncertain 
but  instantaneous  process.  In  [6]  a  cooperative  classification  algorithm  for  stationary  targets  is  developed  in 
which  a  hierarchial  path  planning  is  performed  to  view  complementary  aspect  angles. 

In  [16]  kinematic  information  is  combined  with  the  feature  information  obtained  from  high  range  resolution 
(HRR),  inverse  synthetic  aperture  radar  (ISAR),  and  synthetic  aperture  radar  (SAR)  signatures,  to  obtain 
improved  classification  and  association  for  tracking  moving  ground  targets.  An  integrated  classification  and 
tracking  algorithm  is  presented  in  [2].  In  our  proposed  algorithm  the  classification  and  tracking  are  considered  as 
different  problems  as  it  is  assumed  that  the  GMTI  sensor  mounted  on  UAVs  does  not  provide  any  classification 
information  and  the  classification  sensor  does  not  provide  reliable  position  information. 

In  our  previous  work  [19,  20]  a  decentralized  cooperative  control  algorithm  is  developed  for  a  number  of  UAVs, 
equipped  with  GMTI  sensors,  to  track  multiple  ground  targets  and  search  for  new  ones  in  a  specified  region.  The 
region  is  divided  into  a  number  of  square  sectors2  and  it  is  assumed  that  each  UAV  can  scan  Ns  sectors  each  time. 
In  the  proposed  algorithm  the  UAVs  exchange  the  information  about  the  sectors  that  were  scanned  and  yielded 
no  detections  (“negative  information”)3,  the  measurements  obtained  and  their  current  kinematic  states.  Each 
UAV  decides  on  the  sectors  it  scans  and  the  path  it  follows  on  its  own.  In  this  work  these  decisions  are  decoupled 
into  two  separate  problems  —  scan  decision  and  path  decision  —  to  make  the  surveillance  problem  tractable. 
In  both  cases,  the  corresponding  objective  functions  are  based  on  information  gain.  The  objective  functions 
incorporate  information  from  detected  targets  as  well  as  possible  information  from  the  yet  undetected  targets, 
which  are  included  in  the  form  of  possible  targets  in  the  sectors.  Also,  the  objective  functions  incorporate  the 
detection  probabilities  of  targets,  which  are  based  on  both  range  and  range  rate,  and  the  survival  probabilities  of 
the  UAVs  due  to  hostile  fire  from  targets  and  possible  collision  with  other  UAVs,  in  computing  the  information 
for  a  particular  target-sensor  geometry. 

In  our  current  work  classification  of  detected  targets  is  considered  as  an  additional  objective  of  the  cooperative 
control  algorithm.  It  is  assumed  that,  along  with  the  GMTI  sensor,  each  UAV  carries  a  classification  sensor,  for 
example  a  CCD  camera.  This  additional  sensor  sends  its  information  to  a  classifier  block  that  outputs  one  of 
the  target  classes.  The  probabilities  of  observation  of  different  outputs  given  different  actual  classes  of  targets 
form  the  class  confusion  matrix.  This  matrix  is  assumed  to  be  known  for  each  value  of  targe t-UAV  distance 
and  the  classification  accuracy  increases  as  this  distance  decreases.  In  this  work  it  is  assumed  that  classification 
with  any  degree  accuracy  can  be  obtained  from  any  particular  direction,  i.e.,  spatial  diversity  does  not  improve 
classification  results. 

The  objective  function  for  path  planning,  in  general,  is  a  weighted  summation  of  the  total  kinematic  informa¬ 
tion,  discussed  in  [20],  and  the  total  classification  information  obtainable.  However,  to  facilitate  fast  classification, 
the  information  contribution  of  a  UAV  closest  to  a  detected  target  is  equal  to  the  information  obtainable  from 
this  target  until  it  is  classified  to  a  predefined  level  of  accuracy.  After  that,  this  target’s  importance  in  path 

2 Note  that  it  is  possible  to  incorporate  any  other  shapes  of  sectors  in  this  algorithm. 

3 Information  about  sectors  that  yielded  no  detection  decreases  the  expected  information  content  of  the  corresponding 
sectors.  Hence  such  information  is  denoted  as  “negative  information”. 


planning  is  de-emphasized.  At  the  same  time  a  target  which  is  already  classified  to  a  particular  level  of  certainty 
is  retained  longer  in  the  case  of  no  detection  as  the  tracker  has  higher  confidence  in  the  track  corresponding  to 
the  target. 

The  objective  function  for  scan  decision  is  similar  to  the  one  discussed  in  [20]  with  the  addition  that  a  UAV 
decides  to  use  the  classification  sensor  when  the  predicted  classification  information  gain  is  more  than  a  threshold. 
It  is  assumed  that  a  UAV  can  use  only  one  type  of  sensor  at  a  time. 

The  paper  is  organized  as  follows.  The  objective  functions  for  scan  decision  and  path  planning  is  discussed  in 
Section  2.  In  Section  3  the  cooperative  control  algorithm  is  described  in  detail.  Simulation  results  are  presented 
in  Section  4  and  Section  5  presents  the  concluding  remarks. 

2.  THE  OBJECTIVE  FUNCTIONS 

One  of  the  most  important  stages  in  the  development  of  a  sensor  management  algorithm  is  the  choice  of  objective 
functions  which  should  be  easy  to  optimize  and  the  decisions  based  on  these  functions  should  enable  the  algorithm 
to  achieve  the  sensor  management  goals.  In  this  section  we  introduce  the  objective  functions  which  are  used  for 
the  scan  decisions  and  path  decisions. 

In  this  work  definition  of  information  follows  the  entropic  information  measure.  For  a  random  variable  x 
with  probability  density  function  /(x)  the  entropic  information  measure  is  given  by 

£{ln(/(x))>  =  f  /(x)ln(/(x))dx  (1) 

J  —  OC 

If  x  is  a  Gaussian  random  vector  with  information  matrix  I  (inverse  of  its  covariance)  then  the  entropic  infor¬ 
mation  is  given  by 

IG(x)  =  Iln((27re)"|/|)  (2) 

where  n  is  the  dimension  of  x. 

If  x  is  a  discrete  random  variable  then  the  entropic  information  is  given  by 

Id(x)  =  ^P(xi)lnP(xi)  (3) 

t=l 

where  xts  are  N  discrete  levels  of  x  and  P(Xi)s  are  the  probabilities  of  those  levels. 

2.1.  Kinematic  Information 

In  this  work  computation  of  the  kinematic  information  matrix  (inverse  covariance)  follows  [20].  Since  it  is 
convenient  to  work  with  a  form  in  which  the  total  matrix  information  is  the  summation  of  the  information 
from  different  (independent)  sources4,  in  this  work  we  use  the  following  expression  of  the  expected  updated 
information  corresponding  to  target  j  at  time  step  k  (conditioned  on  the  information  at  k  —  1) 

Ij(k\k)  =  Ij(k\k  -  l)  +  'Y^ns(k,s)nD{k,  s,j)H(k,  s,j)'  R(k,  s,j)~1H(k,s,j)  (4) 
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where  Ij(k\k  —  1)  is  the  information  matrix  (inverse  covariance)  corresponding  to  the  predicted  state,  which 
depends  on  the  target  motion  model  (see,  e.g.,  [1]  ch.  5).  The  combined  probability  that  UAV  s  contributes  to 
the  kinematic  information  about  target  j  at  time  k  is  given  by  ns{k,  s)7Td(A;,  s,  j),  where  ^(/c,  s)  is  the  survival 
probability  of  UAV  s  and  n £>(fc,  s,  j)  is  the  probability  of  detection  of  target  j  by  GMTI  sensor  on  UAV  s  given 
survival  of  the  corresponding  UAV.  The  kinematic  information  obtained  by  a  particular  UAV  s  about  target  j  at 
time  k  is  given  by  H(k,  s,j )'R{k,  s,  j)~lH (k,  s,  j)  where  H(k,  s,  j)  is  the  measurement  matrix  and  R(k,  s,  j)  is  the 

4 This  statement  is  true  for  already  detected  targets  only.  For  sectors  without  detected  target  the  total  matrix  informa¬ 
tion  is  maximum  of  the  matrix  information  obtainable  by  any  sensor.  This  avoids  all  UAVs  from  moving  to  a  particular 
sector. 


measurement  noise  covariance  matrix  corresponding  to  the  sensor-target  pair  at  time  k.  The  expected  updated 
information  corresponding  to  target  j,  when  only  UAV  s  participates  in  kinematic  information  gathering,  is  given 
by 

hj(k\k)  =  Ij(k\k  -  1)  +7rs(fc,  s)TTD{k,s,  j)H(k,  s,j)'R(k,s,j)~lH(k,  s,j)  (5) 

The  detection  probability  of  target  j  by  the  GMTI  sensor  on  UAV  s  at  time  step  k  given  the  survival  of  the 
UAV  is  written  as 

*D{k,s,j)  =TTXD(k,s,j)iT2D{k,s,j)  (6) 

where  ^(fc,  s,  j)  is  the  detection  probability  (of  target  j  by  sensor  s  at  time  k)  as  a  function  of  the  range  and 
is  the  detection  probability  as  a  function  of  the  range  rate.  The  first  term  depends  on  the  specific 
application  and  the  second  term  is  the  probability  that  the  range  rate  measured  by  the  GMTI  sensor  is  higher 
than  the  “minimum  detectable  velocity”  (MDV)  (see  [19]  for  detailed  discussion). 

The  total  survival  probability  of  UAV  s  is  equal  to  the  product  of  target- fire  survival  probability  7r$(/:,  s)  and 
collision  survival  probability  7r| (k,s)  of  this  UAV,  i.e., 

*s(k,  s)  =  7 r^(fc,  s)7r|(fc,  s)  (7) 

where  7r ^(A;,  s),  in  turn,  is  the  product  of  target-fire  survival  probabilities  of  UAV  s  in  view  of  each  target,  i.e., 

”s(fc- s) = n**.  sJ)  (s) 

j 

and  7r|(/c,  s)  is  the  product  of  collision  survival  probabilities  corresponding  to  all  other  UAVs 

7r|(fc,s)  =  ?r|(fc,  s,i)  (9) 

The  dependence  of  these  probabilities  on  the  sensor  to  target  distances  (and  possibly  other  factors)  is  application 
dependent. 

The  surveillance  region  is  divided  into  a  number  of  sectors  and  for  the  sectors  that  do  not  contain  any  of  the 
currently  tracked  targets,  new  (undetected)  targets  are  the  possible  source  of  information.  The  prior  kinematic 
information  for  sector  {m,  n}  is  denoted  by  n,  where  the  uncertainty  is  the  same  as  that  of  a  target  that  can 
be  anywhere  in  the  sector  and  move  at  any  possible  velocity.  The  new  kinematic  information  obtained  by  UAV 
s  from  sector  {m,  n}  at  time  step  k  is  zero  if  there  is  no  detection  and  H(k ,  $,m,  n)'R(k ,  $,  m,  n)~lH(k ,  s,  m,  n) 
if  a  detection  occurs,  where  H(k<  s,  m,  n)  is  the  measurement  matrix  and  R(k ,  s,  m,  n)  is  the  measurement  noise 
covariance  matrix  corresponding  to  the  new  target.  The  position  of  a  new  target  is  assumed  to  be  at  the  center 
of  the  sector.  The  probability  of  a  new  target  detection  event  in  sector  {m,n}  is  given  by  the  product  of  the 
probability  of  detection  7f £>(/:,  s,  m,  n)  and  the  probability  of  the  presence  of  a  new  target  (with  range  rate  above 
the  MDV)  7rnCv/{k,m,n)  in  this  sector,  i.e., 

^D,  new  (/c,  S,  771,  71 )  —  ^"£)(^?  5,  777,  7l)7Tn(%w(/c,  771,  7l)  (19) 

where  the  term  noik,  s,  m,  n)  is  similar  to  7 r^(/c,  s,  j).  The  only  difference  is  that  the  former  is  a  function  of  the 
GMTI  sensor  range  from  the  center  of  a  sector  while  the  latter  is  a  function  of  the  range  between  the  sensor  and 
target. 

The  second  term  in  the  right  hand  side  of  (10),  which  represents  the  probability  of  presence  of  a  (moving) 
new  target  in  sector  {m,  n}  at  time  step  k,  is  assumed  to  have  the  following  form 

7Tnew(A:,  TTl ,  7 1)  =  7 Tmax(m,  7l)  (l  -  (11) 

where  tk  is  the  scan  time  at  step  /c,  7fmax(77i,  n)  is  the  maximum  value  of  the  probability  of  new  target,  tm,n  is 
the  equivalent  last  scan  time  and  Amin  is  a  parameter  that  defines  the  rate  of  increase  of  the  probability  of  a  new 
target  after  a  scan  in  sector  {m,  n}  yielded  no  detection.  This  models  the  “appearance”  of  a  new  target  since 


the  last  scan  of  the  sector  under  consideration.  The  parameters  7rmax(m,rc)  and  AmiU  are  application  dependent 
which  can  be  chosen  to  give  different  importance  to  different  sectors. 

In  view  of  the  above,  the  expected  kinematic  information  matrix  from  sector  {m,n}  when  scanned  by  the 
GMTI  sensor  of  UAV  s  at  time  step  k  is 

Im,n(k,  s)  =  I^  n  +  7 T£>,new(/:,  s,  m,  n)H(k,  s,  m,  n)! R(k,  s,  m,  n)~lH{k ,  s,  m,  n)  (12) 


Entropic  information  can  be  computed  from  the  above  mentioned  information  matrices  by  using  (2).  Note 
that  in  our  work  target  state  estimates  and  predictions  are  assumed  to  be  Gaussian  distributed. 


2.2.  Classification  Information 

The  class  probability  vector  corresponding  to  target  j  before  the  scan  at  time  k  is  denoted  as  fij(k  —  1)  and 
the  classifier  confusion  matrix  corresponding  to  the  target  j  and  classification  sensor  on  UAV  s  is  denoted  by 
C(k,s,j).  An  element  Cab(k,  $,j)  of  the  classifier  matrix  is  defined  as  the  probability  of  the  event  that  the 
classifier  output,  if  target  j  is  scanned  by  the  classification  sensor,  is  C(k,s,jO  =  b  given  that  the  true  class  is 
Kj  =  a,  i.e., 

cab{k,s,j)  =  P{({k,sJ)  =  b\Kj  =  a)  (13) 


If  the  classifier  output  is  6,  then  the  updated  class  probability  is  given  by  [2] 


Hj(k\Csj(k)  =  b)  = 


Cb{k,  s,j)'nj(k  —  1) 


(14) 


where  Cb{k,s,j)  denotes  the  6th  column  of  the  class  confusion  matrix  and  0  is  the  Schur-Hadamard  product 
(term  by  term).  The  corresponding  classification  information  l\j,  fc|£(fc,  s,  j)  =  b]  about  target  j  at  time  step  k 
can  be  computed  using  (3). 

The  expected  updated  class  information  of  target  j  at  time  step  k  can  be  obtained  considering  all  possible 
combinations  of  the  target’s  true  class  and  classifier  output  in  UAV  s  as  follows 

l(k,s,j)  =  E{l(j,k)} 

=  £Zb\*lC(MJ)  =  &]P{<(MJ)  =  &}  (15) 

6 


where  the  probability  that  the  observation  is  6,  is  given  by 

S,j)  =  b}  =  ^Co6(fc,s,j>“(fc-  1)  (16) 

a 


where  n^{k  —  1)  is  the  probability  that  target  j  belongs  to  class  a  given  the  information  at  k  —  1. 

2.3.  The  Objective  Function  for  Scan  Decision 

The  expected  information  gain  from  the  sectors  is  used  as  the  criterion  for  the  scan  decisions  made  by  each  UAV. 
It  is  assumed  that  while  the  GMTI  sensor  can  scan  Ns  sectors,  the  classification  sensor  can  only  scan  one  small 
circular  region  of  radius  r8.  Furthermore,  only  one  sensor  on  each  UAV  can  perform  scan  operation  during  one 
time  period  T. 

Firstly  expected  classification  information  gain  is  computed  for  each  target.  The  target  corresponding  to  the 
maximum  predicted  classification  information  gain  is  chosen  for  a  scan  by  the  classification  sensor  only  if  the 
gain  is  more  than  a  threshold.  Defining 

g3  =  max{T(k,s,j)  —I(k  —  1,  j)}  and  ws  =  argmax{T(fc,  s,  j)  -  l{k  -  1,  j)}  (17) 

i  J 

target  ws  is  scanned  for  the  class  information  by  UAV  s  if  gs  >  c 9 ,  where  c9  is  a  threshold  and  T(k  —  1,  j)  is  the 
class  information  corresponding  to  target  j  at  time  step  k  —  1. 


If  a  UAV  decides  against  using  the  classification  sensor  then  the  kinematic  information  gain  from  the  sectors 
is  computed  in  the  following  manner.  If  there  are  Nm%n(k)  targets  in  sector  {m,n}  at  time  step  k  then  the 
expected  kinematic  information  gain  by  the  GMTI  sensor  on  UAV  s  from  this  sector  is  given  by 

Nm.n(fc) 

■C?(M)  =  £  ln|/,J-(fc|fc)|-ln|/j(fc|fc-l)|  (18) 

i=  i 

where  Ij(k\k  —  1)  is  the  predicted  information  matrix  and  is  the  expected  updated  information  matrix 

corresponding  to  target  j,  when  this  target  is  scanned  by  sensor  s,  as  defined  in  (5).  Note  that  in  (18)  each  term 
in  the  summation  denotes  expected  kinematic  information  gain  corresponding  to  a  target  if  the  sector  {m,n}  is 
scanned. 

If  sector  {m,  n)  does  not  contain  any  of  the  currently  detected  targets  then  the  expected  kinematic  information 
gain  from  searching  this  sector  is  given  by 

=  ln|/m,n(M)|  -  ln|/°,n|  (19) 

where  n  is  the  prior  information  matrix  for  sector  {m,  n}  and  Im,n{k,  s )  is  the  expected  updated  information 
matrix  corresponding  to  sector  {m,n},  when  scanned  by  sensor  s,  as  defined  in  (12).  The  best  Ns  sectors  in 
terms  of  objective  function  s)  are  chosen  for  scan  by  the  GMTI  sensor  on  UAV  s  at  time  step  k . 

2.4.  The  Objective  Function  for  Path  Decision 

The  detected  targets,  which  form  the  set  are  categorized  into  two  sets:  the  set  containing  targets  which  are 
classified  well  (denoted  as  T\)  and  the  set  containing  the  remaining  targets  (denoted  as  T%).  A  target  is  considered 
to  be  classified  well  if  the  maximum  class  probability  is  higher  than  a  threshold.  To  facilitate  fast  classification 
of  each  target  in  set  the  path  decision  of  the  UAV  closest  to  it  is  dictated  by  the  information  obtainable  from 
the  particular  target.  Such  dedicated  UAVs  form  set  A  and  the  remaining  UAVs  form  set  B.  While  information 
gain  from  all  targets  and  all  sectors  is  considered  in  path  decision  of  UAVs  in  set  B,  information  gain  from  only 
the  closest  target  in  set  7-2  is  considered  in  path  planning  of  UAVs  in  set  A  (this  avoids  conflicts). 

The  information  gain  achievable  by  the  sensors  related  to  detected  or  undetected  targets  under  independence 
is  considered  to  be  the  (global)  objective  function  for  the  path  decision,  i.e., 

Jrpath(fc)  =  £  7j  (ln|/,(fc|&)|  —  ln|Jy(fc|fc  -  1)|  +  »7  (max{t(k,s,j)  —  I(k—  l,i)})) 

J67b 

+  £  (ln|/mi„(*)|— ln|i*,n|)  (20) 

{m,n}eSjr, 

where  Ij(k\k  —  1)  and  Ij(k\k)  are  the  predicted  and  expected  updated  information  matrices  at  time  step  k  for 
target  j  within  the  set  of  detected  targets,  denoted  by  Tp.  These  information  matrices  are  defined  above.  The 
quantities  l(k  —  1,  j)  and  T(k,  s,  j)  are  the  classification  information  about  target  j  at  time  step  k  —  1  and 
expected  classification  information  if  the  target  is  scanned  by  the  classification  sensor  on  UAV  s,  respectively. 
Note  that  UAV  s  in  set  A  is  considered  in  computation  of  Ij(k\k)  only  if  j  €  T2  and  UAV  s  is  its  closest.  The 
relative  weight  between  the  kinematic  and  classification  information  is  r/,  which  is  a  design  parameter.  The 
maximum  of  the  classification  information  gain  for  all  UAVs  is  included  in  (20)  as  only  one  UAV  is  expected  to 
obtain  class  information  of  a  target.  To  facilitate  the  detection  and  classification  of  new  targets,  7j  takes  less 
than  unity  value  if  the  target  j  is  in  set  T\.  Otherwise  the  value  of  is  1. 

In  (20)  Sq  is  the  set  of  all  sectors  in  which  there  are  no  currently  tracked  targets  and  /m,n(fc)  is  the  expected 
information  matrix  of  new  targets  in  sector  {m,  n}  of  the  surveillance  region  given  by 

=  4n,n(fc,  s)  where  $  =  argmax  |/m,n(fc,  s) \  (21) 

8&A 

where  /m,n(fc,  s)  is  the  expected  information  matrix  from  sector  {m,  n)  when  scanned  by  sensor  s  at  time  step 
k,  which  is  defined  in  (12).  Classification  information  gain  for  undetected  targets  is  not  included  in  the  objective 


function.  Note  that  in  order  to  restrict  more  than  one  UAV  from  moving  toward  the  same  unscanned  sector,  only 
the  maximum  information  from  all  sensors  is  included  in  the  cost  function  instead  of  considering  the  summation 
of  information. 

In  our  problem  the  UAVs  scan  the  targets  asynchronously  which,  in  general,  leads  to  a  complicated  objective 
function.  However,  if  the  offset  times  are  small,  as  in  our  case,  the  target-sensor  geometry  does  not  change 
much  from  the  scan  by  one  UAV  to  that  by  another  one.  Hence,  for  convenience,  the  objective  function  for  path 
decision  is  constructed  assuming  synchronous  scans  by  all  sensors.  Note  that,  the  algorithm  described  in  this 
paper  provides  a  solution  to  a  one-step  ahead  planning  problem .  It  is  shown  to  be  effective,  while  requiring  much 
less  computation  than  a  longer  horizon  planning,  which  is  currently  under  investigation. 


3.  THE  ALGORITHM 

In  this  section,  the  algorithm  used  by  each  UAV  which  results  in  cooperative  control  of  the  UAVs  as  a  group, 
is  discussed  in  detail.  This  algorithm  is  capable  of  tracking  and  classifying  detected  targets  and  to  search  for 
undetected  ones  by  taking  into  account  that  new  targets  can  start  in  sectors  that  have  already  been  scanned.  In 
this  algorithm  the  decisions  on  path  selection  and  sectors  to  scan  are  taken  separately  by  the  UAVs.  However, 
these  decisions  are  indirectly  connected  as  a  scan  in  a  particular  sector  reduces  the  information  available  from 
this  sector  which  in  turn  affects  the  next  path  decision. 

Each  UAV  performs  the  following  tasks  asynchronously  w.r.t.  the  other  UAVs.  Each  UAV  makes  the  scan 
decision  as  discussed  in  Section  2.3.  If  the  UAV  decides  to  use  the  classification  sensor  it  scans  a  circular  region 
of  radius  rs  around  the  predicted  positon  of  the  intended  target.  Otherwise,  the  UAV  scans  N8  sectors  by  using 
its  GMTI  sensor.  The  expected  information  gain  from  each  sector  is  computed  using  (18)  for  sectors  containing 
currently  tracked  targets  (track  maintenance)  and  using  (19)  for  sectors  which  do  not  contain  any  of  the  currently 
tracked  targets  (search).  The  best  Ns  sectors  are  chosen,  on  that  basis,  for  this  scan.  These  scan  decisions  by 
each  UAV  are  taken  independently  of  the  state  of  the  other  UAVs. 

The  equivalent  last  scan  time  smtTl  is  updated  after  a  scan  by  the  GMTI  sensor  on  UAV  s  as  follows 


•-<**>  *  { t- 


(k)  if  not  scanned 


A(/c,  s,  m,  n,  s)  if  scanned  and  target  not  detected 
where  k+  denotes  time  immediately  after  the  scam  at  tk  and  A(/c,  s,  m,  n)  is  given  by 

A(/c,  s,m,  n)  =  -Am,n  In  1-  ^1  —  [l  —  jtD(k,  s,m,  n)]j 


(22) 


(23) 


In  this  algorithm  each  UAV  stores  the  equivalent  last  scan  time  im,n(k)  of  each  sector  to  keep  track  of  the  new 
target  probability,  which  needs  update  only  if  the  corresponding  sector  is  scanned.  The  derivation  of  (23)  is 
discussed  in  [19]. 

Next,  the  measurements,  which  include  classifier  output  in  case  of  a  scan  by  the  classification  sensor  and 
detections  if  GMTI  sensor  is  used,  the  equivalent  last  scan  time  matrix  and  the  UAV’s  current  state  are  trans¬ 
mitted  to  the  other  UAVs.  There  is  a  nc  probability  that  another  UAV  will  receive  this  transmission.  Each  UAV 
maintains  its  own  set  of  target  tracks,  which  are  updated  when  a  new  set  of  detections  is  either  obtained  by  the 
corresponding  UAV  or  received  from  another  UAV. 

After  scanning  for  measurements,  transmitting  them  and  updating  the  target  tracks,  each  UAV  determines 
its  path  for  the  next  interval  T.  For  a  coordinated  operation  this  decision  depends  on  the  corresponding  UAV’s 
knowledge  about  the  current  locations  of  the  other  UAVs,  the  track  picture  maintained  by  it5  and  the  equivalent 
last  scan  times  of  the  sectors.  The  objective  function  JP*th  in  (20)  is  maximized  to  obtain  the  path  of  the 
UAV.  Since  Jpath  depends  on  the  future  positions  all  of  the  UAVs,  each  UAV  requires  to  optimize  it  w.r.t.  the 
paths  of  all  UAVs  and  then  executes  its  own  part  of  the  solution.  In  this  sense,  each  UAV  works  like  a  central 
node  and  this  way  the  decentralized  system  avoids  possible  online  negotiations  which  can  be  disastrous  in  the 

5Note  that  each  UAV  updates  its  track  picture  using  measurements  obtained  by  it  and  those  received  from  other  UAVs. 
The  track  picture  may  be  different  from  UAV  to  UAV  because  of  imperfect  communication. 


presence  of  communication  problems.  For  a  large  system,  the  computational  load  of  each  UAV  can  be  reduced 
by  considering  only  a  certain  region  around  it.  Note  that  the  proposed  algorithm  makes  the  assumption  that  the 
actions  (turn  rates)  computed  by  different  UAVs  are  almost  the  same  due  to  the  similarity  of  their  information 
about  the  current  track  picture. 

The  knowledge  of  the  state  of  another  UAV  is  updated  when  information  is  received  from  that  UAV.  In  case 
of  a  failure  in  communication,  the  path  decided  for  the  particular  UAV  in  the  last  iteration  is  assumed  to  be  its 
actual  path.  If  no  information  is  received  from  a  particular  UAV  for  a  number  of  times,  the  corresponding  UAV 
is  considered  to  be  lost  and  future  decisions  are  taken  without  considering  it.  However,  if  transmissions  are  once 
again  received  from  a  “lost”  UAV  it  is  included  in  the  future  decisions. 

After  setting  its  course  for  next  interval  T  each  UAV  then  waits  for  any  transmission  from  the  other  UAVs. 
If  a  new  set  of  detections  and  scan  times  are  received  from  another  UAV,  then  the  target  tracks,  maintained 
by  this  UAV,  are  updated  and,  also,  the  equivalent  last  scan  times  are  updated,  where  the  new  scam  time  for  a 
sector  is  the  maximum  of  the  old  scan  time  and  the  received  scan  time.  After  T  seconds  the  UAV  once  again 
makes  its  scan  decision  and  so  on. 


4.  SIMULATION  RESULTS 

In  this  section,  we  present  the  simulation  results  obtained  for  a  58  minute  scenario  in  which  the  surveillance 
region  is  40  km  x  40  km  and  it  includes  6  targets.  The  targets,  which  include  maneuvering  and  move-s top- move 
types,  appear  in  the  surveillance  region  at  different  times.  A  detailed  discussion  on  targets’  paths  is  provided  in 
[!»]- 

The  number  of  UAVs  deployed  is  4  and  each  of  them  starts  at  y  position  of  —12  km  while  keeping  a  distance 
of  1  km  from  the  closest  ones  along  the  x  direction.  Initially,  the  UAVs  move  at  a  speed  of  30  m/s  on  a  course 
of  0°  (along  the  +y  direction).  For  this  simulation,  T  is  5s,  which  means  each  UAV  performs  the  set  of  tasks 
described  in  Section  3  within  that  time.  The  surveillance  region  is  divided  into  4  km  x  4  km  sectors  and  it  is 
assumed  that  the  GMTI  sensor  on  each  UAV  can  scan  5  such  sectors  each  time.  Each  UAV  can  scan  a  circular 
region  of  radius  100  m,  which  is  previously  denoted  by  r5,  by  using  the  classification  sensor.  The  UAVs  start 
with  no  knowledge  about  the  targets  and  each  UAV  decides  on  its  path  by  maximizing  JpAth  in  (20)  based  on 
its  knowledge  of  the  positions  of  the  other  UAVs,  target  tracks  and  last  scan  times  of  the  sectors  known  to  the 
corresponding  UAV.  The  success  probability  of  a  communication  between  two  UAVs,  which  is  denoted  by  nc  in 
Section  3,  is  0.9. 

In  this  simulation  one  point  track  initialization  is  applied  and  track  maintenance  is  performed  by  a  two  stage 
procedure:  measurement  to  track  association,  which  is  performed  by  2-D  assignment  via  the  auction  algorithm 
[4],  and  track  update  using  a  Kalman  filter.  A  discretized  continuous-time  white  noise  acceleration  model  [1]  is 
assumed  for  the  targets  with  process  noise  power  spectral  density  being  1  m2/s3.  The  measurement  noise  s.d. 
corresponding  to  the  GMTI  sensors, are  aT  —  10  m,  oq  —  10~3  rad  and  oy  =  1  m/s.  The  number  of  false  alarms  in 
each  sector,  when  scanned,  follows  a  Poisson  distribution  with  mean  0.1  and  the  false  measurements  are  uniformly 
distributed  in  the  sector6.  As  discussed  in  Section  2.4  a  target  is  considered  to  be  classified  well  if  its  maximum 
of  the  class  probabilities  is  higher  than  a  threshold,  which  in  this  simulation  is  0.9.  In  the  objective  function 
for  path  decision  (20)  the  relative  weight  between  kinematic  and  classification  information  gain  is  denoted  by 
77.  Although  in  this  simulation  spatial  diversity  of  classification  information  is  ignored,  target  the  detection 
probability  by  GMTI  scan  as  a  function  of  range  rate  is  directional.  Hence  classification  information  can  be 
obtained  from  any  direction  but  the  same  is  not  true  for  kinematic  information.  For  this  reason  in  this  work  77 
is  chosen  to  be  equal  to  0.1  to  emphasize  the  kinematic  information.  For  the  targets  that  are  classified  well,  the 
constant  7 j  that  emphasizes  the  corresponding  information  gain  is  reduced  to  0.1  to  facilitate  the  detection  and 
classification  of  new  targets. 

Commonly  for  the  tracking  algorithms  presented  in  the  literature,  a  track  is  deleted  if  it  is  not  associated 
with  measurements  for  more  than  a  predetermined  number  of  updates.  However,  this  rule  is  not  based  on  the 
observability  criterion  and  may  result  in  deletion  of  tracks  because  they  are  unobservable  by  the  sensors  used. 

6 While  the  UAV  management  does  not  explicitly  include  false  alarms,  it  is  shown  to  operate  successfully  also  in  their 
presence. 


To  avoid  this,  in  this  work,  track  quality  index  denoted  by  7rtrack  is  updated  each  time  a  set  of  detections  is 
received,  as  follows 


7Ttrack(fc+l)=  |  ~  ^ 


if  not  associated 
otherwise 


(24) 


where  7td  is  the  probability  of  detection  of  the  particular  track.  If  the  track  quality  index  falls  below  a  prede¬ 
termined  level  of  10~6  for  a  target  that  is  classified  well  and  10~3  otherwise,  the  track  is  deleted.  Note  that 
the  targets  that  are  classified  well  are  retained  longer  in  the  event  of  missed  detection  as  the  tracker  is  more 
confident  about  such  a  track.  Also,  tracks  were  confirmed  only  after  obtaining  3  measurements  for  them.  In  our 
simulations  not  a  single  false  track  is  confirmed. 


In  this  simulation,  the  altitude  of  the  UAVs  above  the  ground  is  considered  to  be  1  km  and  the  ground  is 
considered  to  be  flat.  The  UAVs  fly  at  a  constant  speed  of  30  m/s  and  can  perform  coordinated  turns  with 
angular  turn  rate  up  to  6°/s.  To  set  its  path  for  the  next  period,  each  UAV  decides  its  angular  turn  rate  by 
maximizing  Jpath  in  (20).  Since  JPath  corresponding  to  the  information  matrix  based  algorithm  depends  on 
the  future  positions  of  the  other  UAVs  which  in  turn  depend  on  their  turn  rates  for  the  next  period,  the  turn 
rates  of  all  UAVs  are  determined  in  a  joint  maximization  procedure.  The  Mat  lab  function  Tmincon’  is  used  to 
perform  this  constrained  optimization.  The  angular  turn  rates  of  the  other  UAVs  are  not  transmitted  as  each 
UAV  performs  this  operation  independently. 

For  each  sector  {m,  n}  the  UAV  management  algorithm  needs  to  assume  the  rate  at  which  the  probability 
of  a  new  target  increases  if  the  sector  is  not  scanned,  denoted  by  Am>n  in  (11),  and  the  maximum  value  of  the 
probability  of  a  new  target,  denoted  by  7rmax(^7^)-  In  this  simulation  these  parameters  are  assumed  to  be  5  min 
and  10~4,  respectively,  for  all  sectors.  A  UAV  is  considered  lost  by  another  UAV  if  there  is  no  information  from 
the  former  UAV  for  5  consecutive  periods.  In  our  simulations  none  of  the  UAVs  was  incorrectly  assumed  to  be 
lost,  i.e.,  there  was  no  instance  of  five  consecutive  transmission  failures  from  one  UAV  to  another. 


Figure  1.  Components  of  the  class  confusion  matrix  vs.  target-UAV  distance. 


The  same  survival  and  detection  probabilities  as  a  function  of  range  are  used  as  in  [19].  The  detection 
probability  factor  as  a  function  of  the  range  rate  of  a  target  w.r.t.  a  sensor  is  a  step  function  being  1  if  the  target 
range  rate  magnitude  is  more  than  2  m/s  and  0  otherwise.  In  this  simulation  the  classifier  output  belongs  to  a 
set  of  three,  same  as  the  track  class.  The  class  confusion  matrix  has  the  following  form 


C  = 


fi(r)  f2(r)  h(r) 
f4(r)  fi(r)  f4(r) 
f3(r)  h{r)  fi(r) 


(25) 


where  r  is  the  distance  between  the  target  and  the  classification  sensor  in  the  UAV.  Figure  1  shows  the  elements 
of  C  as  a  function  of  r.  The  minimum  information  gain  for  a  classifier  sensor  scan,  denoted  by  ws  in  Section  2.3, 
is  0.2. 
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Figure  2.  Snapshots  at  the  simulation  times  when  target  1  and  2  are  classified  with  satisfactory  level  of  accuracy. 


xflcm) 


-5  0  5  10  15  20  23  30  35  40  45 

x(km) 


(a)  Target  3  daasffied 


(b)  Target  5  daestted 


Figure  3.  Snapshots  at  the  simulation  times  when  target  3  and  5  are  classified  with  satisfactory  level  of  accuracy. 

Figures  2-4  show  snapshots  in  a  typical  simulation  using  four  UAVs  which  follow  information  based  objective 
functions  for  path  planning  and  scan  decision.  The  snapshots  are  taken  at  the  moments  when  different  targets 
are  being  classified  with  intended  level  of  accuracy.  The  triangular  shapes  represent  the  positions  of  the  UAVs 
and  their  directions  of  motion.  In  these  figures  the  black  lines  represent  current  tracks  and  the  stars  represent 
the  last  updated  position  of  the  corresponding  targets.  Also,  the  yellow  (grey)  patches  denote  sectors  scanned 
by  the  GMTI  sensors  in  the  last  period  and  the  blue  (dark)  patches  denote  sectors  not  scanned  for  more  than 
1  min.  Note  that  in  this  case  the  equivalent  last  scan  time  is  as  in  (22).  The  smaller  red  (dark)  patches  show  the 
target  positions  where  the  classification  sensors  are  used.  It  can  be  seen  that  while  some  UAVs  are  on  a  mission 
to  classify  the  targets  fast,  the  remaining  UAVs  cover  the  rest  of  the  region  for  search  and  tracking. 

Figure  5  shows  the  complete  path  of  the  UAVs  and  the  tracks  obtained  by  UAV  1  during  the  simulation.  It 
can  be  seen  that  UAVs  perform  well  in  terms  of  fast  target  detection  and  tracking.  In  addition  all  six  targets 
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Figure  4.  Snapshots  at  the  simulation  times  when  target  4  and  6  are  classified  with  satisfactory  level  of  accuracy. 


Figure  5.  The  complete  path  of  the  UAVs  and  the  tracks  of  targets  in  the  typical  simulation  using  4  UAVs  which  follow 
the  information  based  algorithm. 

are  classified  with  the  intended  level  of  accuracy. 

5.  CONCLUSIONS 

In  this  paper,  a  novel  cooperative  control  algorithm,  for  a  number  of  UAVs  searching  for  targets  in  a  large 
region,  tracking  and  classifying  them,  is  presented.  According  to  this  algorithm,  each  UAV  broadcasts  its 
current  scan  and  detection  information  and  decides  on  its  path  separately  according  to  an  in  formation- based 
objective  function,  which  incorporates  target  state  information  as  well  as  target  detection  probability  and  UAV 
survival  probabilities  due  to  hostile  fire  by  targets  and,  also,  due  to  possible  collision  with  other  UAVs. 

A  simulated  scenario  that  consists  of  a  40  km  x  40  km  surveillance  region,  and  6  targets,  which  include 
maneuvering  and  move-stop-move  targets  has  been  shown.  The  results  from  a  typical  run  show  that  4  UAVs  are 
able  to  detect  and  track  all  targets.  In  addition,  all  of  the  targets  are  classified  to  the  intended  high  accuracy. 
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